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Chapter 1 


Syllable structure and loanword 
adaptation in Frò?ò 


Yranahan Traoré 


Caroline Féry 
University of Frankfurt 


This article examines the syllable structure in Fró?6, a dialect of Tagbana spoken 
in Côte d'Ivoire. In our analysis, the underlying syllable structure in Frd?0 is lim- 
ited to C(C)V and V. Other surface syllable shapes, such as CVC, are the result of 
synchronic morphophonological processes. These processes include the formation 
of surface complex onsets through vowel deletion, the simplification of underly- 
ing complex onsets through liquid deletion, and the merger of bisyllabic CVCV 
sequences into monosyllables (CVC and CV). Evidence of these phonological pro- 
cess can also be found in loanwords, where syllable repairs take place. 


1 Introduction 


This article studies the syllable structure in Fró?ó (Tagbana), a Senoufo (Gur) 
language of Cóte d'Ivoire (see Clamens 1952, Manessy 1962, Herault & Mlanhoro 
1973, Miehe 2012, Miehe et al. 2012) and the effects that syllabic restrictions have 
on loanword adaptation. 82 introduces the underlying syllable structure and the 
basic phonotactic rules of Fro?0. 83 discusses three resyllabification processes. 
First, two kinds of vowel deletion are introduced: one leading to coda emergence, 
and another one leading to complex onsets. The second resyllabification process 
is liquid deletion leading to onset simplification. $4 examines the process of merg- 
ing two monosyllabic morphemes into a single syllable. 85 focuses on the way 
phonotactic restrictions influence loanword adaptations. $6 summarises and con- 
cludes. 


In Samson Lotven, Silvina Bongiovanni, Phillip Weirich, Robert Botne & Samuel Gyasi Obeng 
(eds.), African linguistics across the disciplines: Selected papers from the 48th Annual Conference on 
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Before turning to syllable structure, let us briefly introduce the phonemic in- 
ventory of Fró?6, its lexical and grammatical tones and the nominal class sys- 
tem. The consonants are shown in Table 1. There are 22 consonants, 10 of which 
are stops and two are voiceless fricatives, but there is no voiced fricative. The 
10 stops are divided into voiceless and voiced ones with five places of articula- 
tion: labial, alveolar, palatal, velar and labio-velar. Two laryngeal obstruents are 
present as well: [?] and [h]. Additionally, there are six sonorants, four of which 
are nasals. The remaining sonorants are two glides, [j] and [w], and two liquids, 
[1] and [r]. The Fró?ó consonant system is close to that of other Gur languages, 
although some differences emerge as well. For instance, voiced fricatives have 
been shown to exist in other Gur languages. 


Table 1: Fró?ó consonants 


labial alveolar palatal velar labio-velar glottal 


; voiceless t c k k ? 
E voiced b d j g E 
Fricative f S h 
Nasal m n n I 
Glide j w 
Lateral l 
Rhotic r 


Frò?ò has seven oral vowels that can be long in some environments, in partic- 
ular before a heteromorphemic [r] or [1]. All vowels have nasal correspondents, 
except for the mid [+ATR] ones, [e] and [o], that are never nasalized; thus all in 
all the language shows 12 vowels, as shown in Figures 1 and 2. In this article, IPA 
is used (Frò?ò has no written system). 


Wa 
o 

AE 

e 


m 
or 


u 
o 
2 ———S_ en, 
a a 
\ 


Figure 1: Short vowels Figure 2: Nasal vowels 


1 Syllable structure and loanword adaptation in Frò?ò 


Frò?ò has three level tones, high (H), mid (M) and low (L). There are no contour 
tones. The syllable is the tone bearing unit (TBU) and every syllable in every 
word carries its own tone, regardless of the category and the length of the word. 
In other words, all vowels in the language bear one of these three tones. Some 
examples of (near-) minimal pairs appear in (1).! 


Table 2: Tonal minimal pairs 


H M L 


hü-mà7 ‘oil’ hü-mü7 ‘worship’ hi-mi7 ‘the drink’ 


pal ‘monitor lizard 
fal ‘bamboo tree 


fié:ré6 ‘the sham’ 


pa ‘to come’ 
fa ‘to build’ 
7105 ‘house’ 
k515 ‘to cough’ 


jó ‘to carry’ 
k$-15 ‘monkey-cmv’ 
péré ‘to sell’ 


Tones play a large role in the grammatical domain, as exemplified in (1), where 
a change in grammatical tone signifies a change in tense/aspect/mood of the 
utterance. 


(1) Tonal changes due to aspect changes 
a. ki ma gla 
PRO.OBJ PRO-2sG please 
‘It will please you' 
b. kí glá 
PRO.OBJ PRO-2SG like 
‘Did it please you?’ 
c. ki glà 
PRO.OBJ PRO-2SG like 
‘Make it please you!’ 


ma 


ma 


Combinations of tones in the nominal domain reveal the existence of floating 
tones in part of the vocabulary, compare (2a) with (2b). The high tone on the verb 
sis in (2b) is the result of a floating tone on the noun (marked with a superscript 
H) that spreads up to the verb.” 


‘Every noun belongs to a nominal class, see below. In the examples in Table 2, the numbers “7” 
and “1” indicate that the nouns belong to class 7 or class 1. The word ki-l5 ‘monkey’ consists 
of a lexical root and a class marker of class 1. 

"Thanks to Annie Rialland for working this out with us. 
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(2) Floating tones 
a. ato:lo an  — àtò:lò si5 
spoon-cM2 to buy 
‘to buy a spoon’ 
b. àblo?0 vam ` —» àblò?ò sí 
peanut-CM5 to buy 
'to buy a peanut? 


Nouns are the result of a sequence of a lexical root and an overt or a covert 
class marker (CM). Table 3 provides an overview of the nominal classes in Fró?ó 
(see Traoré and Féry, unpublished, for nouns and nominal classes). Every noun 
belongs to one of seven nominal classes, which are classified according to the 
phonological form of their class marker and/or associated functional morphemes, 
ie. the morphemes in an agreement relation with the head noun (see Corbett 
1991). Six of the classes form gender pairs of singular and plural, but the nouns 
in class 7 are mass nouns or nouns denoting properties. This class has no corre- 
sponding plural. When the CM is covert, we do not indicate it as a CM in the 
gloss. The lexical root is written with its class instead, e.g., pd1 ‘monitor lizard’ 
and wôti51 ‘python’. 


2 Underlying syllable structure and phonotactics 


Two underling syllable structures are present in Fró?6: the first type of syllable 
consists only of a nucleus (V or [n]), and the second type consists of an onset + 
nucleus, see (3). In the latter case, the onset can be simple or complex, consisting 
of maximally two consonants, thus (C)CV. 


(3) Fró?ó underlying syllable structures 


2.1 V syllable: Nucleus only 


As far as syllables consisting of only a nucleus are concerned, only [a] can start a 
word as illustrated in (4), see Herault & Mlanhoro (1973) for the same observation 
for Tákpér, another dialect of Tagbana; [a] can be oral or nasal.? 


The only exception is the interjection &.hé ‘yes, ok’. 
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Table 3: Overview of the nominal classes of Frò?ò and their class mark- 


ers 


Class markers (CM) 


Examples of nouns of each class 


Class 1 (sg. of gender 1) 
several CM, including 2 


Class 2 (pl. of gender 1) 
CM: [-hele], [-bele], -IV 


Class 3 (sg. of gender 2) 
CM: [-1V] 


Class 4 (pl. of gender 2) 
CM: [-?VIV, -gele] 


Class 5 (sg. of gender 3) 
CM: [-gV]/[-9V]/[-?V] or Ø 


Class 6 (pl. of gender 3) 
CM: [-rV] 


Class 7 (sg. of gender 4) 
CM: [-mV] 


(4) [a]/[à] in word-initial position 
a. àjle-?e 
mirror-CM5 
‘mirror’ 
b. à.wre-?e 
something itchy-cm5 
‘something itchy’ 


ho-lo 
elephant-cm1 
ho-belé 
elephants-cm2 
la:-la 
belly-cm3 
la-?alà 
bellies- cM4 
jē-gē 
month-cm5 
je:-re 
months-CM6 


pü-mü 
water-CM7 


c. à.plè3 
‘shade’ 
d. a.gul 


wotis1 
python 
wotis—helé 
pythons-cm2 
kpé-lé 
knife-CM3 
kpe-gele 
knives-cM4 
af5-95 
newthing-CM5 
af5:-r5 
newthings-CM6 


wē-bē 
foliage-cm7 


‘traditional dance’ 


e. àgo-ló 
mount-CM3 
‘mount’ 


Word-medially, all vowels can be a nucleus, see two examples in (5), each of 
which contains a CM consisting only of a vowel. 


(5) Vowel at hiatus position 
a. pi-3 
child-cm1 
‘child’ 


b. kä.fü-6 


sweat-CM5 
D 2 
sweat 
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Word-initially, before all vowels other than [a], [h] or another consonant is 
needed; see (6) for words starting with [h]. In loanwords starting with a vowel, 
[h] is inserted word-initially, see §5. 


(6) 


[h] initial words 
a. 


bere 
‘to press’ 
bai? 
‘to cook’ 
ho? 
‘to stoop’ 


. hi-?i 


feather-CM5 
‘feather’ 


. hú-?ú 


thorn-CM5 
‘thorn’ 


Syllables consisting of a nasal only are the subject of §2.4 


2.2 CV syllable: onset + nucleus 


All consonants can occupy the word-initial onset position except for the glottal 
stop [?] and [r], both of which do not occur in this position. In (7), monosyllabic 


words are used for illustration. 


(7) 


pul 
‘dog’ 
ba7 
‘this’ 
tol 
‘father’ 
di: 

‘so, that’ 
cá 

‘to fall’ 
jè 

‘to wake up’ 


. kā 


‘to break’ 


. gul 


‘tortoise’ 


i. kpē 


‘to take’ 


j. gbol 


‘gnat’ 


. fa 


‘to build’ 


. SE 


‘produce’ 


. hé 


‘where’ 


. mi 


‘I, me’ 


o. nul 
ox 
p. ni 
‘to fill’ 
q. a 
‘this one’ 
r. jo 
‘to say’ 
S. Wl 
‘him’ 
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Vowel lengthening is triggered by a following liquid, [r] or [1], as shown in (8). 
Liquids at the beginning of word final syllables often are the initial consonant 
of a class marker, but not always. The examples in (8) have a heteromorphemic 
liquid, except for (8f), in which the last syllable is part of the lexical root. 


(8) a. loro d. pü:-lù 
mango-CM6 dog-cm2 
‘mangoes’ ‘dogs’ 

b. ka:-la BEE 
EE child-cm2 
p d children" 

c. pi-ri 
tam-tam-CM6 f. jà.ràl 
‘tam-tams’ ‘lion’ 


Not all vowels lengthen before a liquid, as shown in (9). This happens when 
the vowel follows [?]. In this case, it is deleted or pronounced as a short and weak 
vowel (see §3.2 for vowel deletion). Thus, the sequence [?VrV] blocks lengthening 
of the vowel following [r]. 


(9) a. firí c. n$?5.15 
‘to frighten’ ‘to move’ 

b. hi?iri d hü?ü.rü 

‘to shiver’ ‘to spin’ 


2.3 CCV syllables: complex onset + nucleus 


Consonant clusters in the onset are quite common in Fró?ó. They are mostly 
regulated by the sonority sequencing principle, which states that the sonority of 
segments rises toward the nucleus of a syllable and lowers away from it (see for 
example Clements 1990 for this principle). Stops are the least sonorous segments 
and low vowels the most sonorous ones, as illustrated in (10). 


(10) Sonority hierarchy 
Stops Fricatives Nasals Liquids Glides High vowels Low vowels 
> 


In Frd?0, the maximum number of consonants in the onset is two. Nearly all 
consonants, except for [s], [h], [r] and the glottal stop [?], can be followed by 
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[1] and [r]; even the glides [w] and [j] can form complex onsets with a liquid, in 
violation of the sonority principle, since glides are more sonorous than liquids. 
There is a restriction against a sequence of two coronals if the second one is the 
lateral, thus [tl], [dl], [nl] and [rl] do not occur as complex initial onsets. 

Table 4 lists all possible word-initial complex onsets. 


Table 4: Complex onsets 


pb t d c ; k g kp gh f s h ? m n p y w j 

m + 
l + — + + + + + — 4 - — + + + + 
r + + + + + + + + = + + + 


Words initial complex onsets are illustrated in (11). 


(11) Complex word-initial onsets 


i. 


[pl]: pl3.23 


bamboo-cM5 


ix. [jl]: ji A 


‘wise’ 
bamboo’ x. [jr]: jrè.?é 
ii. [pr]: pró6 ‘to fly’ 
‘chip’ xi. [kl]: 15.24 
iii. [bl]: bi seat-CM5 
‘plowed’ ‘seat’ 
iv. [br]: bré.?é xii. [kr]: kr».?5 
‘to boil’ car-CM5 
v. [tr]: trá.?á car 
‘to stick up’ xiii. [km]: kms 
vi. [dr]: drè.?è ‘to beat 
shift-CM5 xiv. [gm]: gm$ 
‘shirt’ ‘beaten’ 
vii. [cl]: clé.mi xv. [gl]: glé.?è 
woman-CM7 tamis-CM5 
‘womanhood’ ‘tamise’ 
viii. [cr]: crē.?ē xvi. [gr]: gra 
‘to expand’ ‘dirty’ 
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xvii. [kpl]: kplé.?é 
former-cM5 
‘former’ 
xviii [kpr]: kprä.?à 
sugar cane-CM5 
'sugar cane' 
xix. [gbl]: gblè:r 
"beginning 
xx. [gbr]: gbrè.?è 
‘unripe’ 
xxi. [fl]: On 
furuncle-cm5 
‘furuncle’ 
xxii. [fr]: fr5.?5 
‘to scrub’ 
xxiii [sr]: sré.?é 
prayer-CM5 
‘prayer’ 
xxiv. [hl]: hla-?a 
leg-cM5 
‘leg 
xxv. [hr]: hro6 
meal 
‘meals’ 
xxvi. [ml]: mlà-?à 
fight-cm5 
‘fight’ 


xxvii [mr]: mrü.?ü 
corosol-CM5 
*corosol' 


xxviii [nr]: nr 
root-CM6 
€ > 

roots 


xxix. [nl] nl5.25 
‘to write’ 


xxx. [nr]: prá.?á 
‘to hook’ 


xxxi. [pl]: 915-25 
dream-cM5 


H 


‘dream 


xxxii [nr]: gr5.?25 
'to push' 


xxxii. [wl]: wlè-?è 
day-CM5 


xxxiv. [wr]: wré.?é 
‘short’ 


xxxv. [jl]: jlé.mé 
clean-cM7 
‘cleanliness’ 


xxxvi. [jr]: jra.?a 
thing-CM5 
‘thing’ 


Only [k] and [g] can form an initial complex onset with [m], but the segment 
clusters [km] and [gm] are only attested in a few words. Herault & Mlanhoro 
(1973) analyze them as the nasal counterparts of [kp] and [gb] when the follow- 
ing vowel is nasal, see examples in (12). In other words, in their analysis [km] 
and [gm] are allophones of underlying /kp/ and /gb/. An alternative explanation 
is that the vowel following [km] and [gm] is subject to nasal harmony. That ex- 
plains why the following vowel is always nasal. In fact, [kp] and [gb] can be 
followed by a nasal vowel, and this distributional fact speaks against the allo- 
phonic nature of [km] and [gm], see the examples in (12). Word-medial complex 


onsets are listed in (14). 
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(12) 


(13) 


(14) Word-medial complex onsets 


10 


a. km 
‘to hit’ 


b. kmà-?à 
‘nice’ 


a. kpë-mû 
daylight-cm7 
‘daylight’ 

b. kpádà:-là 

funeral yard-cw3 
‘funeral yard’ 


i. 


iii. 


vi. 


vii. 


[pl] 


[bI]: 


[br]: 


[tr]: 


: ti.pl5-75 


ground-CM5 
‘ground’ 


i [pr]: ti.pri-?i 


fresh-cM5 
‘freshness’ 
à.bló-?ó 
peanut-CM5 
‘peanut’ 
à.brà.?à 

‘to bargain’ 


kpa.tr6-76 
whip-cm5 
‘whip’ 


: ka.fi.n.dri-7i 


umbilical cord-cM5 
‘umbilical cord’ 


: ka.clé-?é 


bone-cM5 
‘bone’ 


c. km$.25-15 
fireplace-cm3 
‘fireplace’ 

d. la.gmà.mü 
belly-kind-cm7 


‘kindness’ 


c. lagba-mt 
anger-CM7 
‘anger’ 


viii. [jl]: n.jlé.?è 

wall-cm5 
‘wall’ 

ix. [jr]: ko.jré.lé 
mangoose-CM3 
‘mangoose’ 

x. [kl]: ti.kl3.25 
‘long’ 


xi. [kr]: ka.kra.? 
junk-cM5 
‘junk’ 
xii. [km]: a.km5:-r5 
strike-CM6 
‘the strikes’ 
xiii [gm]: la.gmà-m(ü) 
belly-kind-cm7 
‘kindness’ 
xiv. [gl]: hà.gla.7à 
bow-cM5 
‘bow’ 
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xv. [gr]: bà.grà-?à xxii. [nl]: má.pà-?à 
hoe-cM5 ember-CM5 
‘hoe’ ‘ember’ 
xvi [gbl]: ka.gblo-76 xxiii. [jl]: a.jle-?e 
cudgel-cm5 mirror-CM5 
‘cudgel’ ‘mirror’ 
xvii. [gbr] à.gbre.?e xxiv. [nr]: à.nr5-95 
unripe-CM5 en-CM5 
‘unripe’ ‘pen’ 
xviii. [fl]: tū.flē-?ē xxv. [hr]: ka.hre-?é 
wind-CM5 cashew-apple-cm5 
‘wind’ 'cashew-apple' 
xix. [fr]: ka.fre.?e xxvi. [wl]: kū.wlè.?è 
'to apologize' ‘before yesterday’ 
xx. [ml]: na.ma.?4 xxvii. [wr]: a.wré-?é 
'name of person' something itchy-cm5 


'something itchy' 
xxi [mr]: kámr5-?5 
agouti-CM5 
‘agouti’ 


2.4 Syllabic nasals 


As mentioned above, Frò?ò has tautosyllabic complex onsets involving the nasals 
[n], [m], [n] and [5], which can be followed by liquids, except for [nl] which has 
been shown above to be excluded on other grounds. These clusters obey the 
sonority sequencing principle. Some additional examples appear in (15). 


(15) Nasal-liquid onsets 


a. [ml]: mlà-?à b. [mr]: mrù-?ù c. [nr]: nré6 
war-CM5 corosol-CM5 ‘root’ 
‘war’ *corosol' 


In other cases, a nasal consonant precedes a less sonorous segment. In this 
case, the nasal makes a syllable on its own: syllables in Fró?ó may consist of a 
nasal consonant and nothing else. There is no prenasalized obstruent in Frò?ò. 
These syllables only appear in word-initial position. These nasals are followed 
by a stop (or an affricate) in the next syllable's onset, see (16) for examples. 


11 
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(16) Word-initial syllabic nasal consonants 


a. n.klo C. De e. n.glé3 
‘name of person’ ‘name of person’ pestle 
‘pestle’ 
b. n.da(ha) d. njle-?e f. m.bio 
‘name of person’ wall-cm5 ‘ship’ 
‘wall’ 


The structure of a nasal syllable followed by another syllable is shown in (17). 
It is also relevant when the second syllable has a complex onset, like in n.klo 
‘name of person’. 


(17) Syllabic nasal 


word 


When the stop following the nasal is voiced, the initial syllabic nasal assim- 
ilates to the articulator of the following stop. Before bilabial [b], the nasal is 
labial [m]; before [d] and [j], it is coronal [n]; and before the voiced dorsal [g] 
and labio-dorsal [gb], it is dorsal [n]. Table 5 shows the possible combinations 
between nasals and voiced stops word initially. 


Table 5: Syllabic nasal * voiced stop 


b d i g gb 
m + - - B - 
n - + + - - 
I - - - + + 


The feature filling process in Figure 3 assumes that the nasal consonant [N] 
is underlyingly unspecified for its place of articulation and that it acquires an 
articulator from the following stop. However, it could also be the case that the 
articulator of the nasal is specified as [coronal] and that it loses its coronality as 
a result of spreading of the articulator node. 


12 
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SL SL 
[nasal] Articulator Articulator [-continuant] 
[a] 


Figure 3: Consonant cluster with a nasal, where [x] stands for any plo- 
sive segment 


3 Resyllabification 


In some cases, the underlying syllable structures that were described in $2 are 
subject to modification. This section deals with the following resyllabification 
processes: vowel deletion of the second vowel in a sequence of two vowels in 
two adjacent syllables, leading to CVC syllables (83.1); vowel deletion of the first 
vowel in two adjacent syllables leading to CCV syllables (83.2); and liquid dele- 
tion, a process of onset simplification (83.3).* Furthermore, Fró?ó also has a mor- 
phophonological process: two monosyllabic function words are fused into a sin- 


gle syllable, see $4. 


3.1 Vowel deletion leading to CVC syllables 


If some conditions are met, when two morphemes come together, the last vowel 
of the first morpheme or word deletes, and a coda is created. We refrain from call- 
ing the process “vowel syncope’, since that would imply that it is an unstressed 
vowel that is deleted. However, there is no evidence whatever for stress in Fró?ó. 
Vowel deletion in Fró?ó touches all vowels equally.” 


(18) Coda creation through vowel deletion 
CV.CV — CVC 


“Vowel deletion and liquid deletion have also been documented for Syer, another Gur language, 
see Dombrowsky-Hahn (2015). 

“High vowels are not deleted more easily than mid or low vowels, as is the case in the neigh- 
boring Bamana, Colloquial Bambara (see Green et al. 2014). 
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The result of vowel deletion appears in the third column of the examples in 
(19), showing a sequence of two nouns, or of a noun (or a verb) and an article or 
a class morpheme. 


(19) Vowel deletion in the second syllable 


ti.cé.re + DI —  ticer-f5-l5 
‘madness’ ‘owner-CM1 ‘mad person-cM1' 
ti-71 + ka —  ti?-kà 

'tree-CM5' IND.ART. ‘a tree’ 

fé:.ré + mu — fér-mt 

‘happy person’ CM7 ‘happiness-cm7’ 
ta.?a + lV — t47-à 

*to walk CM3 *walk-cM3' 

klò.?ó + IV —  kló?-ló 

‘to roll’ cM3 ‘fact of rolling-cm3’ 
pé:.ré + IV — pé:r-lé 

‘to sell’ CM3 *sale-CM3' 


In (20), the vowel deletion process takes place three times. 


(20) ka-?a + £5.15 + kr3?3 +ka — 
'village-CM5' ‘chief-cm? 'car-CM5'  IND.ART. 
kà?-f51-kr5?-kà 
‘chief of village's car’ 


The application of vowel deletion and subsequent coda creation are subject to 
three restrictions. First, the deleted vowel is identical to the preceding one. This 
happens when a CM is added to a lexical root, in which case it is the result of 
vowel harmony. The CMs are of the form CV with a specified C, dependent on 
the class of the noun, and an unspecified V, which is a total copy of the last vowel 
of the root. 

In case the final vowel is prespecified and is thus not the result of vowel har- 
mony, the final vowel cannot delete. The words in (21) are lexical roots that do 
not take an overt CM. Moreover, the second vowel is not identical to the first 
one. 
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(21) a. jù.gd5 b. dà.gól c. nü.bü.ói d. wó.ti 
‘head’ ‘sheet’ ‘friend’ ‘python’ 


The second restriction is that only consonants that start a CM can become 
the coda of a syllable. As a result, the sonorants [m, L r, n], the voiced stops [b] 
and [g], and the glottal stop can appear in a coda after vowel deletion. All other 
consonants cannot. 

Third, vowel deletion is blocked at the end of a sentence, see the examples in 
(22). In both cases, the last vowel would delete in the sentence-internal position, 
but fails to do so in sentence-final position. The words ending in a consonant in 
these examples have lost their final vowel, which is always a copy of the preced- 
ing vowel. 


(22) a. kàjíó-rda wō la de? pé ná tí hó?ó 
camps-CM6 REL.PRO PRO.1PL ASP show PRO.3PL to PRO.3PL burn 


‘The camps that we showed to them have burned’ 


b. tr? gā kí tō nàdó-?  nà kí ni 
tree-CM5 REL.PRO PRO.35G fall down yam-cM5 on PRO.3SG AUX 
kpa gba -nà 
big-cM5 


"The tree that has fallen down on yam is big: 


3.2 Vowel deletion leading to CCV syllables 


In (23), vowel deletion applies in the first syllable of a lexical root. In such a case, 
the deleted vowel is not that of a CM, but that of the root. However, it is again 
identical to the following vowel. The first example (23a) shows a lexical root 
plus a class marker playing the role of a nominalizer suffix. The second example 
(23b) shows a derived noun plus an additional class marker playing the role of 
a derivational suffix. It is conspicuous that this second vowel deletion concerns 
lexical root vowels. In Fró?6, as in other West African languages, some vowels 
may be called “weak” and are subject to deletion in the right context (see Sande 
2017 for similar effects in Guébié, a Kru language). Weak vowels contrast with 
"strong" vowels that never delete. 


Green et al. (2014) show that in Bamana, syncope results in high sonority codas, but low sonor- 
ity ones are excluded. This is not always the case in Frò?ò, although high sonority codas are 
more frequent. 


15 


Yranahan Traoré & Caroline Féry 


(23) Vowel deletion in the first syllable 


wé.lé + ?V —  wlé-?é 

‘to look’ CM5 ‘look-cmM5’ 
‘the look’ 

cé-lé + mu — dclemü 

D 5 € 3 

woman-cM1 CM7 woman-CM7 
D 5 
womanhood 


3.3 Liquid deletion 


Another resyllabification process in Fró?ó involves liquid deletion in the second 
position of a complex onset. The syllable structure is simplified as shown in (24): 
only the less sonorous first consonant C is preserved. 


(24) Liquid deletion 
CIV / CrV — CV 


Liquid deletion takes place in the first morpheme in a sequence of two adjacent 
morphemes, the examples in (25) show lexical roots plus CM. In this case, liquid 
deletion is restricted to identical vowels separated by a CM starting with [r], as 
illustrated. 


(25) Vowel deletion in lexical roots plus cm 


a. dré + cM6 — dé-ré 
bee ‘bees’ 
b. kabré + CM6 —> kabé:ré 
grasshopper ‘grasshoppers’ 
c. kra + CM6 — kà:-rà 
file ‘files’ 


Another context in which liquid deletion applies is a sequence of a noun and 
an adjective, see (26). These words are compounds consisting in two lexical roots, 
a nominal one and an adjectival one. The class marker is not always there, but if it 
is required, it is located at the end of the compound. Note that in (26a) neither pls 
nor plé requires a CM in isolation, and the resulting compound has no CM. The 
same holds for other compounds in (26). The vowels do not need to be identical. 
The first consonant of the adjectives is sometimes voiced. It is not clear what 
triggers voicing. 
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(26) Liquid deletion in compounds 


a. pl + plè —  p5-ble 
bamboo small ‘small bamboo’ 
b. kr + plè —  k5-plé 
car small ‘small car’ 
c. hla + k — ha-gl5 
leg long ‘long leg’ 
d. käm + kp»  — kámo-gbà-2) 
agouti big ‘big agouti-CM5' 
e. kahré + kéré —  kahé-géré 
cashew field ‘field of cashew’ 
f Ho + kp5 —  ko-gb5-7?5 
road big ‘big road-cM5’ 


g. ka.clé + kpd —  kà.ce-gb5-?5 
bone big ‘big bone-cm5’ 


Liquid deletion also takes place if the compound is preceded by yet another 
lexical root, see examples in (27). 


(27) Liquid deletion in nominal compounds 


a. pio +  po-blé —  pio.po.blé 

child small bamboo ‘child’s small bamboo’ 
b. tō + k5.plè —  tô.ks.plè 

father small car ‘father’s small car’ 


Liquid deletion in monosyllabic nouns without a CM is not possible. The words 
in (28) illustrate that in such a case, the liquid is not deleted. 
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(28) No liquid deletion in monosyllabic nouns 
a. Kéi + ple —  kló-plé (*ko-ple ) 


country small ‘small country’ 


b. Ve + ple  — Keplé 


post small ‘small post’ 

c. kró5 + ple —  kró-plé 
rubber small ‘small rubber’ 

d. plo + plè  —  plo-plè-?élé? 
plants small ‘small plants-cM4' 


Liquid deletion takes place in words with a non-high vowel. A liquid preceded 
by a high vowel, [i] or [u], as in (29), cannot be deleted. 


(29) No liquid deletion before a high vowel 


a. frü-?ù  frü:-rù b. Hnän klù-rù —— c. flizi "mä 
mat-CM5 mat-CMÓ Way-CM5 way-CM6 spot-CM5 spot-CM6 
‘mat’ ‘mats’ ‘way? ‘ways’ ‘spot’ ‘spots’ 


Liquid deletion does not happen in verbs. In all examples, liquid deletion takes 
place in the first element of a compound. The second compound does not lose its 


liquid. 


(30) No liquid deletion in the second element of compounds 


a. pid +pl — pi3.pl5-?5 *(pi3.p5-?75) 
child bamboo 
"child e bamboo’ 
b. tō + kro — to.kr3-?5 
father car 
‘father’s car’ 


4 Merging process in Fró?ó 


Merging (or portmanteau building) denotes the process of making one word out 
of two. In Fró?6, two independent monosyllabic function morphemes are merged 


"The last noun (28d) plo does not have a singular equivalent. 
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into a single syllable, for instance two pronouns, or an auxiliary and a pronoun. 
The pronouns may have different functions in the sentence: subject and one or 
two objects for instance. There are two ways of achieving merging in Fró?6, the 
first of which is fusion, resulting in a CV syllable, and the second of which is 
resyllabification with vowel deletion and consequent coda creation (CVC), of the 
same type described in $3.1 


4.1 Merging as fusion 


In the fusion process, only one syllable is formed out of two: the onset of the 
first word is preserved and the resulting vowel has features from all remaining 
segments. First examples appear in (31). In these fused morphemes, two mono- 
syllabic pronouns on the left are pronounced in one syllable, as shown to the 
right of the arrows. In all cases, the onset of the first pronoun is kept: these are, 
respectively, [k], [t], [1], [m], [w] and [p]. The fused segment is always [u] and 
is the result of fusing [i] from the first pronoun and [wí] from the second pro- 
noun. In all examples, all words have a high tone, and unsurprisingly the fused 
morpheme also has a high tone. 


(31) Fusion of two pronouns 


a. kí wi nä — kú pà 
PRO1.3SG PRO3.35G see 
‘it has seen him’ 

b. ti wi pa — tú på 
PROI.3PL PRO3.35G see 
‘they have seen him’ 

c. lí wi nä — lina 
PRO2.3SG PRO3.35G see 
‘it has seen him’ 

d. mi wi pa — mina 
PRO.1SG PRO3.35G see 
‘T have seen him’ 

e. wi wi pa — wana 
PRO3.3SG PRO3.35G see 
‘he has seen him’ 

f. pi wi pa — puna 
PRO4.3SG PRO3.35G see 
‘they have seen him’ 
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The features involved in the fusion process are illustrated in (32) and (33). The 
features of the resulting vowel are a compromise between the features of the 
original vowels and [w], the initial consonant of the second pronoun. Since both 
vowels are [+high], this feature is kept. The feature [labial] of [w] is translated 
into vocalic [round] and [round] in Fró?ó only appears on back vowels. For this 
reason, [i] becomes [u] in the fused version. 


(32) ki wi 
ku 


(33) Features of the fused segments [i], [w] and [i] 


[i] [i] 

: [w] : 
+high : +high 
| -back | | labial | -back | 
[u] 

«round 
| +high | 


However, the fused vowel is not always [u]. In (34), it is [5], the result of fusing 
[&] and [i]. Notice that [o] cannot be nasalized and [5] is used instead. 


(34) má wi pa — mona 
PRO.2SG PRO3.3SG see 
‘you have seen him’ 


In this case, a compromise is needed between [low] from [4] and [+high] from 
[i]. The result is the mid back vowel [5], a round nasal segment by default. Nasal- 
ity of the first word is preserved, and height is a compromise. As before [round] 
is derived from the labiality of the glide [w]. 
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(35) Features of the fused segments [a], [w] and [i] 


[a] [i] 
[w] d 
| | +high 
| i | [ labial ] P | 


nasal 


[3] 
round 
-high 


nasal 


Further examples of fused morphemes appear in (36). 


(36) a. pé wi nä — pó pà 
PRO3.3PL PRO3.35G see 
‘they have seen him’ 
b. ké wi pa — kó na 
PRO3.3SG PRO3.3SG see 
‘they have seen him’ 


In (37), two monosyllabic pronouns pé ‘they’ and wi ‘him, her’ are fused. The 
fusion segments are [é] from the first pronoun and [wi] the second pronoun and 
the result of the fusion is [6], again a back vowel in order to keep the feature 


[labial] from [w]. 


(37) Features of the fused segments [é], [w] and [i] 


[6] [í] 

: [w] . 
-high . +high 
| -back | | DM | -back | 
[6] 


4.2 Merging as resyllabification with vowel deletion 


Fusion of the kind illustrated in §4.1 is not always a possible outcome. In the 
following examples, it is still the case that two monosyllabic function words are 
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fused into one, and the result is thus a portmanteau word in each case, but the 
process is different: the onset of the second morpheme becomes the coda of the 
syllable made by the first morpheme. The examples in (38) show how the lateral 
[1] of the object pronoun lí becomes the coda of the syllable of the first pronoun. 
A similar process of vowel deletion was already illustrated in §3.1 An important 
difference is now that the deleted vowel does not need to be identical to the 
preceding one, see (38f). 


(38) Fusion of two pronouns leading to a syllable with coda: CV + CV — CVC 

a. ki lí pa — kil pa 
PRO1.3SG PRO3.35G see 
‘it has seen hit’ 

b. ti lí pa — til pà 
PRO1.3PL PRO3.35G see 
‘they have seen hit’ 

c. li lí pa — lil på 
PRO2.3SG PRO3.35G see 
‘it has seen hit’ 

d. wi li pa — wil pa 
PRO3.3SG PRO3.35G see 
‘he has seen him’ 

e. ke lí pa — kél pa 
PRO2.3PL PRO3.35G see 
'they have seen him' 

f. pé lí nä — pél pà 
PRO3.3PL PRO3.35G see 


'they have seen him' 


5 Loanwords 


Loanwords are a good place to verify the generalizations obtained for constrain- 
ing the syllable structure of a language. They validate syllable structural pro- 
cesses in Frò?ò that have been discussed in the previous sections of this article. 
Loanword phonologies have received much attention in the last decades, see 
for instance Itó & Mester (1995), Peperkamp & Dupoux (2003) and others. They 
shape the phonological structure of new words entering the language and impose 
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native phonotactic and segmental constraints on them. Speakers of a language 
confronted with the task of articulating foreign words adjust the phonological 
shape of these words to render them more native. This implies that the segments 
of foreign words need to be adapted to the native phonemic inventory in case 
they are not there from the start. Moreover, and this is what we are primarily 
interested in here, the constraints on the syllable structure of the borrowing lan- 
guage needs to be respected. Silverman (1992) made a distinction between two 
adaptation tasks: at the “perceptual level”, the acoustic input, or the acoustic im- 
age that the speaker has of the foreign word, is interpreted as a string of native 
segments. At the “operative level”, all segments are assigned licit positions in the 
syllables. However, the two levels are not always clearly separated from each 
other. An illicit consonant in the coda for instance can be accommodated in (at 
least) two ways: either it is replaced by a licit consonant that may happen to be 
similar to the original one. An illicit consonant in the coda for instance can be 
accommodated. For example, it could be replaced by a licit consonant that may 
happen to be similar to the original one, the syllable structure could be repaired 
by adding a vowel so that the illicit coda is adapted as the onset of a new sylla- 
ble, or the coda could be deleted altogether. Consider first the examples in (39) 
showing how French words starting with [r] are adapted in Fro?o. 


(39) Loanwords: French — Fró?ó 
a. i. radio [radio] — [à.ra.jíó] ‘radio’ 
ii. rateau [Eato] — [à.rà.to]/[hrà.to] ‘rake’ 
b. i. robot [robo] — [hó.ró.bó]/[hro.bo] ‘robot’ 
ii. regarder [sogasde] — [hé.ré.ga.dé]/[hré.ga.de] ‘to look’ 
iii Rémis [remi] — [hé.ré.mi ]/[hré.mi] ‘name’ 
c. i. route [rut] — [hrü.ti] ‘road’ 

When the word starts with [x] in French, its equivalent [r] is disallowed word- 
initially in Fró?ó. Three repair options are available, as shown in (40) and (41). 
If [a] is the first vowel of the first syllable, this vowel is copied and added word- 
initially, see (40a). If the vowel is not [a], the vowel is also copied and added 


word-initially, but in this case, [h] is also added, before the vowel, as in (40b). 
Alternatively, only [hr] is realized, the vowel is not copied, see (40c). 


(40) Repair processes for words with a word-initial [r] 
a. Ea — a.ra (ara jíó) 
b. rV — hV.rV (hó.ró.bó) 
c. rV — hrV (hrü.ti) 
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Word-internal coda [x] is repaired in at least two ways, as shown in (41). It can 
elide, like in the words in (41a). Alternatively, it becomes the onset of a syllable 
containing an epenthetic vowel identical to the preceding one, see in (41b). 


(41) Word-internal coda [r] repairs 
a. [x] is deleted word-finally: 
i corde [kokd(s)] — [k$:.di] ‘rope’ 
ii porte [post(a)] — [p3:.ti] ‘door’ 
iii. écorce [ekoxs(a)] — [hé.k$:.si] ‘peel’ 
b. Word-internal vowel epenthesis: [5] becomes an onset: 
i carcasse [kaskas] — [kà.rà.ká.si] ‘carcass’ 


ii. calme [kalm(a)] — [ká.lá.mà] ‘calm’ 


It was shown above that word final codas with a low sonority are not allowed 
in Fró?ó. In adapting words from French containing such codas, an epenthetic 
vowel is added word finally, as shown in (42). The epenthetic vowel is a high 
vowel alternating between [i] and [u]. [i] is preferred after front vowels and [u] 
after back vowels. 


(42) Epenthesis of [i] or [u] 
a. i Yvette [ivet] — [hí.véti] ‘name’ 
ii. Frák [feák] — [frá.ki] ‘name’ 
ii. hirondelle [ixodel] — [hi.r3.dé.li] ‘swallow’ 
iv. aigle [egl(o)] — [hé.gli] ‘eagle’ 
v. maitre [mets(a)] — [métri] teacher 
b. i. robe [rob] — [h$.r5.bù] ‘dress’ 


ii. rose [soz] — [hó.ró.(z)sà] ‘rose’ 


In sum, the syllable restrictions shown in 82 lead to a tendency of transforming 
vowel initial loanwords from French into Fró?ó words by adding an initial [h]. 
An initial [r] is repaired in diferent ways. Moreover, syllables with a disallowed 
coda in Fró?ó are adapted into the language with an epenthetic final vowel. 

Loanwords are also frequently adapted from Bambara, a neighboring Mand- 
ing language.? Two nouns from Bambara can combine and lead to a single word 


*The Manding people come from West Africa. They are known by different names such as 
Bambaras in Mali, Dioulas in Cóte d'Ivoire and Burkina Faso, and Malinkés in Guinea, Senegal 
and Gambia. 
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in Fró?6, see examples in (43). The monomorphemic words of Bambara are rein- 
terpreted in Fró?ó as consisting of a lexical root plus a CM. When they combine 
to form a compound, the second syllable of each word is deleted, and a CM5 is 
added. The onset of the CM5 is an epenthetic glottal stop. 


(43) Bambara — Praia 
a. né.né + dà.gá — nè.dá-?á 
‘iron’ ‘pot’ iron pot-CM5 
‘iron pot’ 
b. kügü +tigi ^ — ki.di-?i 
‘head’ ‘owner’  person.in.charge-CM5 
‘person in charge’ 


In (44a) the second syllable of the Bambara noun has a disallowed coda which 
is deleted in the adaptation process. The example (44b) shows vowel deletion in 
the first syllable: a trisyllabic noun becomes disyllabic and the second syllable is 
now a CM. It is class 1 because it is a loanword, and loanwords often are of class 
1. 


(44) a. baran.da — ba.ral 
‘banana’ banana 


b. sara.ka — sra.?a 
‘aumône’ aumóne.cM1 


Other loanwords from Bambara are not modified in Fró?ó. 


(45)  sé.bé (46) bè.sé 
‘book’ ‘machete’ 


As to the question of the phonological theory of loanword adaptations vs. re- 
pairs as phonetically-based perceptual adaptations, it may well be the case they 
are both justified in their assumptions, but that they apply to different cases. The 
perceptual analysis (Peperkamp & Dupoux 2003) makes sense for speakers who 
are largely monolingual and do not have any plasticity in the use of several lan- 
guages. Its defenders take for instance allophony between [r] and [1] in Japanese 
and Korean to be an explanation of loanword adaptation in terms of phonetic 
deafness, leading speakers to recode nonnative sounds as native ones during per- 
ception. This allophony is part of the core native phonology of Japanese (Itó & 
Mester 1995). The phonological perspective assumes that the phonological forms 
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of loanwords are computed by the phonological grammar of the borrowing lan- 
guage, leading to massive repairs and restructuring. 

However, Fró?ó speakers are confronted with a multitude of languages on 
a daily basis. They are surrounded by other Tagbana dialects, Manding, Baolé, 
Nouchi, French in at least two varieties (standard and popular) and often more, 
all languages with different phonological systems. It can be plausibly assumed 
that their perception is finely tuned. The perceptual view makes the implausible 
assumption that the speakers literally hear an entire syllable hé at the beginning 
of regarder. Rather, pronouncing this word in Fró?ó forces speakers to adapt to 
the phonotactic pattern of their own language. 

A final question that is too often ignored is how we measure similarity be- 
tween the original word in the source language and the loanword. In Fró?6, 
the number of syllables only plays a subordinate role (we saw that deletion of 
segments is a frequent process, changing the number of syllables), and the seg- 
mental configuration of the syllables is also very important. This paper has only 
scratched the surface of what is to be discovered in Frò?ò. 


6 Conclusion 


This short paper investigated the intricate syllable structure conditions found in 
Fró?6, a dialect of Tagbana. First, the phonotactic conditions regulating the very 
simple underlying syllable structure were examined. Two syllable structures are 
allowed in Fró?6: syllables containing only a vowel or a nasal (nucleus-only syl- 
lables), and syllables with an onset (CV or CCV syllables). Word-initially, [a] is 
the only vowel that can appear in a V syllable, all other vowels need an onset 
consonant. All consonants can appear word-initially except for [r] and [?]. We 
suspect that [r] is associated to two syllabic positions and thus needs a preced- 
ing segment to satisfy its double association, see Fanselow et al. (In preparation) 
for detail. As for the glottal stop, it is an epenthetic consonant inserted between 
two identical vowels. In a second step, the conditions of resyllabification were 
discussed. Here we showed that three processes are active that cause both more 
complex and simplified syllables: first, vowel deletion that engenders syllables 
with a coda, second vowel deletion that engenders syllables with a complex on- 
set, and third, liquid deletion leading to simpler onsets. Next, the process of merg- 
ing two monosyllabic function words was the subject of 84. It was shown there 
that if the onset of the second function words is the glide [w], fusion can take 
place. Fusion is the process of making only one syllable out of two monosyllabic 
function words. The initial consonant of the first word is left unchanged, and the 
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resulting vowel is a compromise between the features of the remaining segments. 
If fusion is not possible because of incompatible features, the vowel of the second 
word is deleted and the result of merging is a syllable with a coda. In the last sec- 
tion, loanwords were addressed. It was shown there that the syllable structure 
of loanwords is adapted to the syllable’s constraints that have been described in 
the preceding sections of the paper. 


Abbreviations 

AUX Auxiliary 

CM Class Marker 

OBJ Object 

PL Plural 

PRO Pronoun 

RELPRO Relative Pronoun 
SG Singular 
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Chapter 2 


Variable word-final vowel deletion and 
reduction in Gurmancema: A maximum 
entropy model 


Maggie Baird 
Dartmouth College 


Gurmancema (Gur, Burkina Faso) displays an overall dispreference for word-final 
tense vowels phrase-medially. On the surface, there is vowel reduction and vowel 
deletion, which vary both across and within phonological contexts. This work will 
provide an overview of the complex data patterns and describe a weighted con- 
straint approach to the data patterns using a Maximum Entropy Harmonic Gram- 
mar. Weighted constraints are preferred to ranked constraints due to variability in 
the data and to account for cases of constraint ganging, including superadditivity. 


1 Introduction 


Gurmancema (ISO 639-3: gux), also known as Gulmancema, Gourmanchema, or 
Gourmanché, is a Gur language spoken in Burkina Faso, Togo, Benin, and Niger. 
It has approximately 1 million speakers across this region (Simons & Fennig 2017). 
Primarily a spoken language, there is much dialectal variation across speakers in 
different villages and areas. Gurmancema is an SVO language with three tones 
(H, M, L). 

Gurmancema has a five vowel system /a, e, i, o, u/. In this paper, I take these five 
phonemic vowels to be [+tense] and the allophone [ə] to be [-tense].! /a:, iz u:/ 
are phonemic in Gurmancema and contrast with /a, i, u/. There are conflicting 


'The feature [tense] is used for convenience to distinguish peripheral vowels from the central 
vowel [»] and is not meant to make theoretical claims about the [tense] status of these vowels. 
The markedness constraints InENT[tense] has a weight of 0 in the model and does not change 
the analysis. 
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accounts of whether the long vowels /e:, o:/ are also phonemic.” There is also 
debate over which diphthongs are phonemic in the language. My analysis only 
makes reference to short, word-final monophthongs, so I remain agnostic. 

Table 1 shows the consonant inventory of Gurmancema for the dialect of study 
in this paper. See §4 for more information about the dialect in question. 


Table 1: Consonant Inventory of Gurmancema 


Labial Alveolar Palatal Velar  Labiolvelar Glottal 


Stop p, b t,d GI kg kp, gb 
Fricative f S h 
Nasal m n n I jm 

Liquid l 

Tap f 

Glide w j 


§2 of this paper describes the complex data patterns in question in this study. 
In §3, I will discuss previous work in Gurmancema and on similar patterns of 
vowel deletion. §4 describes the data collection and analysis process. §5 presents 
a constraint-based analysis of a few illustrative phonological environments. In 
§6, I will summarize my findings. 


2 Data presentation 


In Gurmancema, word-final vowels in verbs and nouns are often deleted or re- 
duced to [ə] in phrase-medial position. The surface representation of these vow- 
els depends on many factors, including the sound that follows them. For example, 
in cases of vowel hiatus, the word-final vowel of the first word is always deleted, 
shown in (1): 


(1) /súnd-i à=bób=à/ [súnd àbóbà] 
forget-ASP CL.PL=jug=CL.PL 
‘forgets jugs’ 
The behavior of the word-final vowel also depends on the sound that precedes 


it. For example, the vowels /a, i, o/ all reduce to [ə] when preceded by an obstru- 
ent, shown in (2): 


"For grammars containing phoneme inventories and justifications, see Chantoux (1968) and 
Naba (1994). 
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(2) a. /ki=bé=ga pwá/  [kibégo pwá] 

cr.sc-child-cr.sc hit 
'child hits' 

b. /ti-kpén-di pwá/  [tikpend> pwá] 
CL.SG=sauce=CL.sG hit 
‘sauce hits’ 

c. /o=numb=0 pwá/ | [onumba pwá] 
CL.sG-donkey-cr.sc hit 
'donkey hits' 


As can be seen, many factors come together to influence the surface represen- 
tation of underlying vowels, not just what sound precedes or follows the vowel. 
Different vowels behave differently in the same context. /i/ reduces to [ə] when 
preceded by an approximant (3a), but /a/ will surface as [a] in the same immedi- 
ate phonological environment (3b). 


(3) a. /da=li pwá/ [dalà pwá] 
friend-cr.sc hit 
‘friend hits’ 
b. /a=bé=la pwa/  [abéla pwá] 
CL.PL-child-cr.Pr hit 
‘children hit’ 


To complicate matters, certain patterns in Gurmancema are categorical, while 
others are variable. In (2), I showed that /a, i, oi categorically reduce to [ə] when 
preceded by obstruents, but the vowel /u/ shows a variable pattern. The percent- 
ages and token counts are from the sample included in this paper, which consists 
of data elicited from one speaker. 


(4) a. /bü-tí-bu kpá/  [bütibu kpá] (67%, n = 3) 
CL.SG-tree-cr.s kill [butiba kpa] (33%, n = 3) 
‘tree kills’ 
b. /ká-dá-gu kpá/ ^ [küdágu kpá] (67%, n = 3) 
CL.SG-wood-cr.sa kill [kádágo kpa] (33%, n = 3) 
‘wood kills’ 


As these examples show, this is a case of token variation, where the same word 
can be pronounced differently on different occasions with no change in meaning. 
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Word-final vowels preceded by the bilabial nasal also show a variable pattern, 
which is not only predictable by the CV combination, but also takes into account 
the first consonant of the next word. When the next word begins with a bilabial 
consonant, the vowel varies between reduction and deletion, but when the con- 
sonant has a different place of articulation, the vowel is much more likely to 
reduce. 


(5) a. /mí-je-má pwá/ [mizem pwá] (67%, n = 6) 
CL.SG-food-cr.sc hit [mizema pwá] (33%, n = 6) 

‘food hits’ 
b. /mí-je-má kpá/ [mizema kp] (88%, n = 32) 
CL.sG-food-cr.sa kill [mizem kp] (12%, n = 32) 

‘food kills’ 


(6) lays out the general data patterns presented above: 


(6) V+#+V deletion of the first vowel 
Obstruent + /a, i, o/ +#+C reduction of the vowel 
Obstruent + /u/ + # + C variation between reduction and deletion 
Non-nasal sonorant + /i/ + # + C reduction of the vowel 
Non-nasal sonorant + /a/ + # + C no reduction or deletion 
/m/ + /a, i, o/ + # + Non-Bilabial C reduction of the vowel 


/m/ + /a, i, o/ + # + BilabialC variation between deletion and reduction 


There are several environments which are not included in this analysis. /e/- 
final words are morphologically distinct and involve a different set of constraints 
than is presented below. Other environments, such as non-nasal sonorant + /o, 
u/, and /m/ + /u/ are missing due to a lack of data. 

In this paper, I will walk through a constraint-based analysis that models all of 
these patterns and also accounts for the variation. Vowel reduction and deletion 
have been noted in the literature on Gurmancema, but never fully analyzed in- 
cluding variation. The model incorporates 13 constraints and necessitates supper- 
additivity (Albright et al. 2008; Green & Davis 2014) to account for the patterns 
in the most economical way. 


3 Previous work 


In the literature on Gurmancema, there is not a complete account of vowel dele- 
tion and variation. Naba (1994) notes that three authors have touched on issues 
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of reduced vowels from a phonotactic perspective, but they all have different 
analyses. He says that too little attention has been paid to elision, apocope, and 
assimilation and that the description and analysis of these phenomena are very 
important for tonal and morphological reasons. 

The only work focusing specifically on vowel deletion and reduction is Rial- 
land (1980; 2001). She labels the process “vowel punctuation and deletion”, argu- 
ing that there are weak and strong forms of words. Strong forms occur at the ends 
of phrases, where word-final vowels are maintained. In weak position, or phrase- 
medially, she argues that vowels are deleted and an epenthetic [i] is inserted to 
prevent illicit consonant clusters. She argues that these processes do not affect 
monosyllabic words. In Rialland (2001), she elaborates that monosyllabic words 
are slightly lengthened. Her paper does not attempt a constraint-based analysis 
- I will argue that these frameworks are better suited to representing the data 
than rule-based phonology. 

Across Niger-Congo languages, vowel elision is also common as a repair for 
vowel hiatus. Across word boundaries, the first vowel is always deleted (Casali 
1997), which holds true for Gurmancema. Although this is one aspect of the 
vowel-deletion process in Gurmancema, vowels do not only delete in hiatus con- 
texts in Gurmancema, as can be seen in the examples in (5). There is no process 
analagous to the word-final vowel reduction in Gurmancema that I could find in 
the Niger-Congo literature. 


4 Data collection and analysis 


All of the data in this paper are based on the speech of a 37-year-old female 
speaker from the village of Mahadaga, currently living in France. All data were 
elicited in summer 2016. I designed an elicitation plan with the aim of obtain- 
ing every possible CV#C combination in the language across Subject-Verb and 
Verb-Object boundaries. I also elicited data in other syntactic conditions, such as 
relative clauses and Adjective-Noun pairings, but this paper will focus solely on 
the Subject-Verb and Verb-Object interfaces. There were restrictions on phone- 
mic distribution, however, as Gurmancema has multiple noun classes, marked 
by proclitics and enclitics. Nouns only begin and end with certain sounds, with a 
few exceptions. I created an elicitation plan that included 26 nouns used as sub- 
jects, with 25 different CV endings. I elicited 20 transitive verbs and 4 intransitive 
verbs with each of the subjects. Verbs had to vary with both the initial consonant 
and the final CV, and I attempted to include as many environments as possible 


at both edges. 
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This study aims to provide a phonological analysis grounded in acoustic pho- 
netics rather than perception. The informing data were not field notes, but rather 
the database of acoustic measurements. I tagged every word-final vowel (n = 
3855) in my Gurmancema sample for important values: the vowel (a, e, i, 0, u), 
the preceding phoneme, the following phoneme, the role in the sentence (S, V, 
O), the surface form (maintained, reduced, deleted), whether it was followed by 
a pause, the length of the vowel in seconds (3 decimal points), and F1, F2, and F3. 
All phonetic measurements were made in Praat (Boersma & Weenick 1996) by 
hand. Given the large number of attributes I was tracking, I did not find a script 
that I thought would be more efficient than taking measurements by hand. I took 
formant measurements at the middle of the vowel in the most stable portion of 
the formants. There is a possibility for human error and measurement variation, 
but taking formant measurements by hand allowed me to evaluate any problem- 
atic data points individually. Formant measurements were taken for a total of 
3855 vowels, including diphthongs (n = 200) and phrase-final vowels (n = 1297), 
which were not included in the final analysis. Phrase-final vowels were not in- 
cluded as the reduction and deletion processes do not apply at the end of phrases. 
I also did not analyze any pre-pausal data from slow speech. The goal of this 
study was to model naturalistic speech and not have any confounding effects 
from pauses. 

I used formant measurements to determine whether a vowel was maintained 
or reduced. I took a sample of canonically maintained vowels and took mutually 
exclusive ranges in F1 and F2 to determine whether the more ambiguous cases 
were maintained or reduced. The results of my phonetic analysis show that there 
is one central reduced vowel [ə], which can be seen in Figure 1. Although schwa 
does seem to overlap with the vowels /o/ and /u/, none of the cases where [a] is 
that far back and high are underlyingly /o/ and /u/. These are cases of reduced 
/i/ and /a/. 

Throughout the analysis, the term “prevocalic consonant” refers to the con- 
sonant that is in penultimate position of the given word (i.e. the onset of the 
final syllable). The term "following consonant" refers to the first consonant of 
the following word. 

For example, in the phrase /kibéga pwá/ ‘child hits’, the prevocalic consonant 
is /g/ and the following consonant is ‘p’. 

In general, there are three surface forms for vowels. If a vowel surfaces the 
same in the output and the input, I will refer to it as “maintained” If a vowel 
surfaces as [a], I will refer to it as “reduced.” If a vowel is deleted, I refer to it as 


“deleted.” 
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Figure 1: Plot of maintained vowels and schwa in Gurmancema. Note: 
Ellipses mark standard deviations from the mean formant values of 
each vowel. 


5 Constraint-based analysis 


In this section, I will explain the model I used to analyze the data. 


5.1 Maximum Entropy Harmonic Grammar 


Due to the complex and variable nature of the process, these data are well-suited 
to a constraint-based analysis. A major component of the complexity in the Gur- 
mancema data is the free variation, which necessitates a stochastic constraint- 
based model. In this paper, I use Maximum Entropy Harmonic Grammar (Max- 
Ent), which captures patterns of variation (Goldwater & Johnson 2003; Hayes & 
Wilson 2008). 

In MaxEnt, each candidate receives as score, which is calculated the same way 
as Harmony in traditional Harmonic Grammar (HG), by summing the weights 
of violations incurred. The calculation of the score h(x) is as follows: 


(D h(x) = DN, wiCi(x) 
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wj is the weight of the ith constraint, C;(x) is the number of times x violates 
the ith constraint, and the sum is the summation over all constraints in the model. 

The MaxEnt grammar tool (Wilson et al. 2008) optimizes all of the constraints 
and assigns each of them weights that can most closely match the observed prob- 
abilities of each candidate. Essentially, the probability of any given candidate is 
calculated from its score in comparison to the scores of all the candidates. For 
example, if each of two candidates have a score of 1, they will have a probability 
each of 0.5, so they will each be selected 50% of the time by speakers as they are 
equally well-formed. 

The probability P(x) is calculated as follows based on all the possible forms in 
Q: 


(8) P(x)= ope) where Z = dyer exp(-h(y)) 


The base of the natural logarithm e is raised to the negative of the score, and 
the probability is calculated over all possible forms. 


5.2 Fitting the model 


I split my data into unique CV#C combinations, and entered each with its ob- 
served probabilities into the learner. The model input consists of a set of con- 
straints, an underlying form, and several (usually 2-6) plausible output candi- 
dates, each with observed probability computed from my phonetic analysis. In 
total, I entered 364 unique tableaux into the MaxEnt Grammar Tool, with stan- 
dard settings. The model generates weights for each of the constraints that yield 
the probabilities closest to the observed probabilities. In total, the model required 
13 constraints to model the patterns observed in the data. 


5.3 Initial constraints 


In this section, I will present the initial constraints that are the minimum set to 
account for the simplest data patterns. As I go through the different patterns in 
more detail, I will describe and add any other necessary constraints. A full list of 
constraints and their final weights can be found in §5.8. 

Other models for reduction and deletion often attribute vowel reduction and 
deletion to prosodic motivators. In Gurmancema, it is the word-final position 
that is targeted for reduction and deletion regardless of tone or stress (which 
does not appear to be a salient prosodic feature of the language). For this reason, 
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the constraint I am using to explain this pattern is simply a markedness con- 
straint targeting word-final tense vowels in the language. All data in this paper 
are phrase-medial, so each constraint is assumed to be phrase-bounded. 

The overall dispreference for word-final tense vowels is captured by the mar- 
kedness constraint NOFINALTENSEVOWEL: 


(9) NoFiNALTENSEVOWEL (*[+tns]#): Assign a violation for every word-final 
tense vowel 


The repairs triggered to satisfy *[+tns]#, either by reducing a word-final vowel 
or deleting it, come with faithfulness violations. By reducing a vowel from any of 
the phonemic vowels to [a], the vowel changes the feature [+tense] to [-tense]. 
This change violates the IDENT(tense) constraint, which penalizes changes in the 
feature [tense] from the input to the output. In the final model, however, this 
constraint did not ever influence a winning candidate, so it is omitted from the 
rest of the paper. 

Reducing a vowel changes other features, so reduction is penalized through 
more specific constraints, described in (10). 


(10) a. IDENT-IO(Low) (IDENT(lo)): Assign a violation for every segment that 
does not surface with the same value for [low] in the input and the 
output. 

b. IDENT-IO [LABIAL] (IDENT(lab)): Assign a violation for every segment 
that does not surface with the same value for [labial] in the input and 
output. 

c. IDENT-IO [Hicu] (IDENT(hi)): Assign a violation for every segment 
that does not surface with the same value for [high] in the input and 
the output. 


IDENT(lo) penalizes /a/ when it raises to [a], but does not target any other 
underlying vowels. IDENT(hi) penalizes /i/ and /u/ when they lower to [a], and 
IDENT(lab) penalizes /u/ and /o/ when they reduce. As described in (2), these 
vowels behave differently in the same surrounding contexts, which is why three 
different IDENT constraints are necessary rather than the general IDENT(tense) 
constraint. 

The faithfulness that penalizes vowel deletion is MAx(V), described below: 


(11) MaxiMIZE(VOWEL) (MAx(V)): Assign a violation for every vowel in the 
input that does not surface in the output. 


Constraints that interact with subsets of the data more specifically will be 
presented in the following sections. 
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5.4 Vowel hiatus resolution 


Gurmancema has a vowel hiatus resolution process similar to other Niger-Congo 
languages (Casali 1997). The markedness constraint that motivates the hiatus 
resolution is as follows: 


(12) (VV): Assign a violation for every sequence of vowels across a syllable 
boundary. 


Vowel hiatus in Gurmancema is always resolved by deleting the first vowel, 
or the word-final vowel. A tableau showing an example is shown in Table 2. 


Table 2: Tableau for /di#a/ with sample phrase ‘forgets jugs’, n = 10 


p "VV *[+tns]# Max (V) IDENT (hi) 
/sándi àbóbà/ obs pred score 9.34 6.14 3.62 2.99 
a. [sándi àbóbà] 0  -0 1544 1 1 0 0 
b. [sándo àbóbà] 0  -0 1263 1 0 0 1 
c. [sánd àbóbà] MEC 362 0 0 1 0 


In the tableau in Table 2, candidate (a) has a vowel sequence and a word-final 
tense vowel, so it violates "V.V and [+tns]#, incurring a score of 9.34+6.14, totaling 
15.44. Candidate (b) violates "VV and IDENT(hi), incurring a score of 9.34 + 2.99, 
totaling 12.63. Candidate (c) only violates Max(V), incurring a score of 3.62. This 
is the lowest score by a significant amount, so the model predicts candidate (c) 
will surface approximately 100% of the time, which is what was observed. Vowel 
hiatus resolution is entirely regular and predictable in Gurmancema, with the 
word-final vowel always being deleted. 


5.5 Prevocalic obstruents 


Consonant clusters beginning with obstruents are highly marked in Gurman- 
cema, leading to a preference for reduction over deletion when word-final vowels 
are preceded by an obstruent. In other words, word-final vowels are dispreferred, 
but when deletion would lead to an illicit consonant cluster, reduction is the pre- 
ferred fix. The vowels /a, i, o/ reduce when preceded by an obstruent over 90% of 
the time. The vowel /e/ is not addressed in this paper, as in the dataset it is always 
part of root material, which behaves differently from affixal/clitic material. The 
vowel /u/, however, shows variation when preceded by an obstruent. To start to 
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account for this data, a new constraint is needed, penalizing consonant clusters 
beginning with obstruents. This is not a general *CC constraint, because not all 
consonant clusters are equally marked in Gurmancema (see 2). 


(13) *[-sonorant][+consonantal] (TC): Assign a violation for every 
[-sonorant][+consonantal] consonant cluster in the output. 


Table 3 shows an example of reduction of a vowel following an obstruent. The 
tableau shows observed data for all /Ta#C/, meaning all environments of an ob- 
struent followed by /a/ followed by a consonant. 


Table 3: Tableau for /Ta#C/ with sample phrase ‘child loses’, n = 88 


p “[+tns]# “TC Max(V) IDENT (lo) 
/kibéga biani/ obs pred score 6.14 4.55 3.62 3.52 
a. [kibéga biáni] 0.05 ~0.07 6.14 1 0 0 0 
b. [kibégo biáni] 0.91  -0.92 3.52 0 0 0 1 
c. [kibég biáni] 0.04 ~0.01 8.17 0 1 1 0 


Word-final /i/ and /o/ behave very similarly to word-final /a/ when preceded 
by an obstruent, but /u/ behaves slightly differently. Reducing /u/ violates both 
IDENT(hi) as well as IDENT(round), so the reduced candidate has more penalties 
than the other word-final vowels. This leads to a case of ganging, where the vio- 
lation of two lower constraints is strong enough to overpower one stronger con- 
straint, leading to variation. While violating either IDENT(high) or IDENT(labial) 
produces patterns similar to those seen in Table 2, violating both means almost 
double the penalty for that candidate, making reduction a significantly worse 
choice for /u/ than the other vowels. The ganging and variation is illustrated in 
Table 4. 


Table 4: Tableau for /Tu#C/ with sample phrase ‘tree wants’, n = 141 


p *[+tns]}# *TC MaAx(V)  IDENT(lab)  IDENT(hi) 
/bütibu bwa/ obs pred score 6.14 4.55 3.62 3.62 2.99 
a. [bútíbu bwà] 0.69 ~0.57 6.14 1 0 0 0 0 
b. [bútíbə bwa] 0.31 ~0.35 6.61 0 0 0 1 1 
c.[bütíbbwà] 0 ~0.08 817 0 1 1 0 0 
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The model is very accurate in predicting the distribution for maintenance 
(a) and reduction (b), but the predictions are slightly off, leading to an over- 
prediction of deletion (c), which was not observed. This is a result of the model 
being more general. The goal of this analysis was to account for the data with 
the simplest set of phonological constraints rather than overfitting the model by 
tailoring constraints to small subsets of data. "TC does not have a higher weight 
due to the few exceptional cases where TC consonant clusters occur as a result 
of deletion. MAx(V), the only other constraint violated by (c), does not have a 
higher weight due to cases of vowel deletion elsewhere in the data. 


5.6 Prevocalic approximants 


Prevocalic approximants behave similarly to prevocalic obstruents overall due to 
a similar markedness constraint on consonant clusters beginning with approxi- 
mants, parallel to (13). 


(14) *[+approximant][+consonantal] (*RC): Assign a violation for every 
consonant cluster beginning with an approximant. 


The reason we need two constraints "TC and "RC rather than a general "CC 
constraint is because consonant clusters beginning with nasals are permissible 
in Gurmancema. This markedness constraint leads to reduction, as with the con- 
straint "TC. 

There were no cases of word-final /e/, /u/, or /o/ preceded by an approximant. 
Word-final /a/ preceded by approximants behaves differently from /i/, however, 
always surfacing as a maintained [a]. The vowel /a/ reduces in other cases, such 
as after obstruents (see Table 2), so this is not a result of constraints preventing 
/a/ from reducing. There is nothing morphologically unique about the words 
that exhibit this pattern, and it is very consistent. To account for this pattern, a 
new solution is necessary. The first ingredient in the solution is the following 
constraint: 


(15) *[+approximant][-tense] (*R[ə]): Assign a violation for every 
approximant in the output followed by a [-tense] vowel 


From a data perspective, it is clear that this pattern refers to both the input and 
the output, where having a reduced vowel after an approximant is marked, but 
worse when the vowel derives from /a/ as opposed to /i/. The candidate where an 
approximant is followed by /a/ and the /a/ reduces violates IDENT[low] as well 
as "R[o]. However, the violation of these two constraints is not strong enough 
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to predict the correct forms. Both would need low weights to allow reduction 
in other environments, and their combined penalty is not enough to prevent /a/ 
from reducing. Thus, *R[a] and IDENT(Lo) on their own motivate variation across 
the language in cases where there should not be any. In other words, ganging 
alone is not sufficient to make the correct predictions. 

The only way to add more weight to this combination is to use constraint con- 
junction, where a candidate incurs a violation if and only if it violates each of two 
constraints included in the conjunction. The conjoined constraint has a separate 
weight that is added to a candidate’s Harmony. It may seem that Harmonic Gram- 
mar relieves the need for conjoined constraints, but this is not always the case. 
Superadditivity, the conjoining of constraints in a weighted model, was devel- 
oped to capture cases not easily captured by simple additive constraints (Albright 
et al. 2008; Green & Davis 2014). Smolensky (2006) argues that superadditivity 
in an HG context is necessary for local violations across a specific domain. In 
the case of Gurmancema, the constraints both refer to the onset and nucleus of 
one syllable. The pattern in these data was highly specific and very regular, so 
the learner needs more specific grammatical tools to model it correctly, but with 
constraint conjunction, this is possible with ingredients already in the model. 

The addition of the conjoined constraint *R[a] & IDENT(Lo) correctly predicts 
the data distribution. Although many superadditive models only use superaddi- 
tivity for conjoining markedness constraints (Albright et al. 2008), there is also 
precedent for using faithfulness constraints in a superadditive manner (Green 
& Davis 2014). Nevertheless, there is some debate about whether markedness 
and faithfulness constraints can be conjoined (Moreton & Smolensky 2002). Con- 
joined constraints are intended only to be used for patterns in local domains. 
According to Moreton & Smolensky (2002), as the domain of markedness is the 
output while faithfulness necessarily refers to the input, it is not clear whether 
such conjunctions can be truly considered local. In this case, the only way to rep- 
resent the data was to conjoin a faithfulness and a markedness constraint (*R[ə] 
& IDENT(lo)). 

Tableaux showing the different behaviors of /a/ and /i/ preceded by /r/ are 
shown in Tables 5 and 6. 

Without the use of superadditivity, the model predicts variation in this envi- 
ronment which showed categorical behavior. An example tableau is shown in 
Table 7. 

This set of constraints still captures general patterns, but as it predicts varia- 
tion where there is none, it does not satisfactorily capture the data. 
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Table 5: Tableau for /Ri#C/ with sample phrase ‘hair covers’, n = 192 


*R[o]& 
p IDENT(lo) *[+tns]# "RC  Max(V) IDENT (hi) "R[o]* 
/tiyàri noagéni/ obs pred score 8.13 6.14 6.04 3.62 2.99 0 
a. [tiyüri néagéni] 0.05 ~0.04 6.14 0 1 0 0 0 0 
b. [tiyüro noagéni] 0.91 ~0.96 2.99 0 0 0 0 1 1 
c. [tiyür noagéni] 0.04 ~0 9.66 0 0 1 1 0 0 


Table 6: Tableau for /Ra#C/ with sample phrase ‘children sell’, n = 160 


*R[a] & 
p IpENT(lo) “[+tns#] "RBC | Max(V) IpzNT(lo) *R[ə] 
/abéla küari/ obs pred score 8.13 6.14 6.04 3.62 3.52 


a. [abéla küari] 1 70.97 6.14 0 1 0 0 
b. [abélo küari] 0 ~0 11.65 1 0 0 0 
c. [abél küari] 0 ~0.03 9.66 0 


oro 
CO ba eil Gi 


5.7 Prevocalic nasals 


When /m/ is in penultimate position, it typically triggers reduction in the final 
vowel. This is counter-intuitive, as there is no markedness constraint against 
consonant clusters beginning with nasals. Consonant clusters beginning with 
nasals are in fact common in Gurmancema, occurring both within words and 
across word boundaries. Gurmancema, however, has strong nasal place assimila- 
tion, meaning consonant clusters with different places of articulation are highly 
marked. The markedness constraint is described in (16). 


(16) “[+nasal, +aplace][-aplace] (NASALASSIMILATION): Assign a violation for 
every consonant cluster beginning with a nasal where the second 
consonant has a different place than the nasal. 


Table 7: Tableau for /Ra#C/ with sample phrase ‘children sell’, n = 160 
without superadditivity 


p "RC “*[+TNs#] IDENT (lo) “*R[a]  Max(V) 
/abéla küari/ obs pred score 9.83 9.64 7.84 3.37 3.33 
a. [abéla küari] 1 ~0.81 9.64 0 1 0 0 0 
b. [abélo káari] 0 Am 14521 8 0 1 1 0 
c. [abél küari] 0 om 1316 1 0 0 0 1 
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NASALASSIMILATION prevents the consonant clusters with mismatched place 
but IbENT(lab) prevents /m/ from assimilating to the following consonant. In the 
context of final vowel deletion, this situation leads to a strong preference for 
vowel reduction to avoid creating such a mismatched cluster. In the cases where 
the vowel after /m/ does delete, however, the nasal retains its bilabial place due 
to IpeNT(lab). This does result in cases where there is no nasal place assimilation, 
which is rare in Gurmancema, but predicted correctly with the constraints in this 
model. A tableau illustrating all the constraints for prevocalic /m/ is presented 
in Table 8. 


Table 8: Tableau for /misC/ with sample phrase ‘elephants eat’, n = 44 


NASAL 
p “[+tns]# Max(V) rpENT(lab  IDENT(hi) ASSIMILATION 
liläomi dá/ obs pred score 6.14 3.62 3.62 2.99 2.55 
a. [iliomi dá] 0.07 ~0.04 6.14 1 0 0 0 0 
b. [ilúomò dá] 0.91 ~0.91 2.99 0 0 0 1 0 
c. [iliom dá] 0.02 ~0.04 6.17 0 1 0 0 1 
d. [ilüon dá] 0 70.01 7.24 0 1 1 0 0 


In cases where the next consonant is also bilabial, the vowel is more likely 
to delete because it only violates MAx(V), rather than Max(V) as well as either 
NASALASSIMILATION or IDENT(lab). 


Table 9: Tableau for /ma#C/ with sample phrase ‘food hits’, n = 6 


NASAL 
p *[«tns]? | MAx(V) IDENT(lab) IDENT(low) ASSIMILATION 
/mizema pwá/ obs pred score 6.14 3.62 3.52 2.99 2.55 
a. [mizema pwá] 0 ~0.04 6.14 1 0 0 0 0 
b. [mirems pwá] 0.33 -0.50 3.52 0 0 0 1 0 
c. [mizem pwá] 0.67  -0.46 3.62 0 1 0 0 


While there may be greater deviation between observed and predicted proba- 
bilities in this case, the sample size for this environment was considerably smaller 
than for other environments modeled, given its specificity. 


5.8 List of constraints 


In the preceding subsections, I have presented tableaux with only the relevant 
constraints. In order to understand the complete analysis, (17) provides a full list 
of constraints and their weights, in order from highest to lowest weights. 
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(7)  *VV 9.34 
IDENT(lo) & *R[a] 8.13 
*[+tns]# 6.14 
*RC 6.04 
Dr»(C) 5.79 
*TC 4.55 
IDENT(lab) 3.62 
Max(V) 3.62 
IDENT(lo) 3.52 
IDENT(hi) 2.99 
NASALASSIMILATION 2.55 
IDENT(tns) 0 
*R[a] 0 


6 Conclusion 


This paper provides the first full account of word-final vowel reduction and dele- 
tion in Gurmancema in a constraint-based model. These data are very complex 
and have been largely left unanalyzed in the limited literature on this language. 
Unlike in other cases of vowel reduction and deletion, the patterns are not en- 
tirely motivated by stress patterns or vowel hiatus resolution. These data can 
only be represented by looking at a number of phonological factors. 

A Harmonic Grammar model was necessary to account for cases of constraint 
ganging, and a MaxEnt version of HG was used to predict the variation found 
in certain parts of the data. The model necessitated the addition of superaddi- 
tive conjoined constraints to represent highly specific yet regular patterns. These 
superadditive constraints included conjoined Faithfulness and Markedness con- 
straints as opposed to simply conjoined Markedness constraints. Although this 
goes against traditional theories of conjoined constraints, these constraints were 
motivated by the complexity of the data. It was also more natural to conjoin pre- 
existing constraints rather than invent new and highly specific constraints to 
account for the data. 

This model accounts for the purely categorical patterns in Gurmancema, such 
as vowel hiatus resolution, as well as variable patterns. This paper does not in- 
clude a full representation of all the complexity in Gurmancema, however. There 
is much room for future work in Gurmancema. In particular, the interaction of 
this process with tone and syntax remain. 
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Child speech and language disorders are not well known in many African countries. 
The aim of this study was to analyze speech sound disorders in Cameroon in order 
to encourage the rehabilitation of language disorders in general in the public health 
system. From a sample of 1127 children, 6% of children presented with speech sound 
disorders including speech delays, articulation and phonological disorders. Boys 
were more affected than girls. Complex syllables and fricatives sounds were the 
most impaired while omission and substitution were the most frequent errors. 


1 Introduction 


Language disorders are not well known in many sub-Saharan countries. Our re- 
view of the literature indicates only a few African countries such as Togo, Nige- 
ria, Kenya and South Africa, where studies have been carried out to describe 
the actual situation of these disorders (e.g. Van der Linde et al. 2016; Topouzkha- 
nian & Mijiyawa 2013). This paper aims to contribute toward this effort of better 
understanding language disorders in African countries given their considerable 
impact on primary education. Language disorders are impairments that affect 
the human language faculty and appear through one or many linguistic compo- 
nents of the spoken and written language. It is now well known that children 
with one or more of these impairments are at risk for limited social achievement. 
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Speech sound disorder (SSD) and literacy difficulties (i.e. reading and spelling 
problems) are the most frequent language disorders among children (Ruscello et 
al. 1991). Our study specifically concerns the first category which are difficulties 
in producing or using speech sounds, very often the consonants, without organic 
alteration (such as hearing loss or cleft palate). The general objective of this study 
is to analyze these disorders in Cameroon in order to encourage the rehabilita- 
tion of language disorders in the public health system. In fact, in Cameroon as in 
many Sub-Saharan countries, children are not screened neither referred for as- 
sessment of speech and language disorders during their third or fourth year as it 
is usual in Nord American countries for example. In what follows, we present the 
profile and prevalence of SSD in order to shed light on their main characteristics. 


1.1 Profile and classification of SSD 
Examples to illustrate some speech sound errors are given in Table 1. 


Table 1: Speech sound errors 


French word Illustration Error types 

viande [vjad] — [bad] substitution 

couscous [kuskus] — [tutu] substitution; omission 
chaise [fez] — [fe] omission 

table [tabl] — [tab] omission 

chocolat [fokola] — [kokola] consonant harmony 


The generic terminology for these disorders varies in the literature, but in this 
study, we use the term “speech sound disorder” (SSD) to include articulation dis- 
orders, phonological delays and phonological disorders. Articulation disorders 
are phonetic in nature. For example, a child can systematically replace a sound 
with another regardless of the syllabic context (see examples 1 and 2 above) while 
another may systematically omit it (as in example 2). These types of systematic 
errors indicate a phonetic difficulty to perform the movement required to pro- 
duce a sound (Fox & Dodd 2001). Another child may exhibit phonological er- 
rors that are specific to younger children. Here, a child may assimilate or omit a 
sound in a specific context and correctly produce it in another (as in examples 3, 
4 and 5 above). The persistence of these types of errors above the expected age 
results in phonological delay. Thus a delay of at least six months is significant 
for this purpose (Dodd 2013). However, when phonological errors are different 
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from phonological processes (normal errors in phonological development), they 
are considered as atypical phonological errors and can be consistent or inconsis- 
tent depending on whether the same error pops up often or occasionally (Dodd 
2013). Finally, a child may exhibit a systematic error on a sound and assimilate 
another sound, thus showing an overlap between articulation and phonologi- 
cal problems. Therefore, we use the term SSD to include both articulation and 
phonological problems and the likely overlap between them. 


1.2 Prevalence of SSD 


There is a lack of consensus on the prevalence rate of SSD from one study to 
another, ranging between 23% and 2% from our literature review McKinnon et 
al. (2007); Shriberg et al. (1999); Fombonne & Vermeersh (1997); Beitchman et al. 
(1986); Enderby & Philipp (1986); Kirkpatrick & Ward (1984); Silva et al. (1984); 
Silva (1980); Peckham (1973); Stevenson & Richman (1976); Morley (1972); Hull et 
al. (1971), Chevrie,. This inconsistency could be explained by the diversity of so- 
cial factors (different countries: USA, Australia, Canada, France, Great Britain) as 
well as differences in methodological approaches (cross-sectional or longitudinal 
study, random or non-random sampling, one or several age groups). However, 
there seems to be a decrease in the rate of prevalence with age, these speech 
disorders being more frequent in younger preschool-aged children compared to 
school-aged ones (Morley 1972). This rate decrease can be explained by the fact 
that preschool children have not yet acquired the complete phonology of their 
language and tend to master the use of speech sounds between the ages of six 
and seven years for languages such as French (e.g. Rvachew et al. 2013). Simi- 
larly, since some speech disorders are phonological delays, they may evolve and 
some children can catch up without speech intervention and perform well in for- 
mal tests (e.g.Bishop & Edmundson 1987). However, it is worth noting that this 
apparent catch up may be illusory as some children often have residual phono- 
logical processing impairments (e.g. Stothard et al. 1998). Furthermore, all studies 
have higher prevalence rates for boys than girls (e.g. Shriberg et al. 1999). Finally, 
regarding the impact of socioeconomic status (SES) on the prevalence of articula- 
tory disorders, McKinnon et al. (2007) conducted a study with 10425 students in 
Australia and found no difference between SES groups. In conclusion, the main 
variables to be studied for the prevalence of SSD are the status of the spoken 
language, age and gender. 
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2 Context 


2.1 Sociolinguistic description 


Like many sub-Saharan countries, Cameroon is a multilingual country where 
children are exposed to at least two languages in their environment. Generally, 
bilinguals are categorized according to several factors but the main ones in this 
study are the age and order of exposure to both languages and the social sta- 
tus of the two languages. Therefore, we limited our attention to early bilingual 
Cameroonian children who speak Ghomálá' and French and who live in one of 
the following sociolinguistic contexts: either a dominant French milieu (urban 
area) or a dominant Ghomálá' milieu (rural area). However, whatever the region, 
French is the main language used at school in Cameroon and children are en- 
rolled in the nursery school system from age three. This diglossic categoriza- 
tion of languages is a general feature of the sociolinguistic context in Cameroon, 
where the Administrative Atlas of National Languages (Breton & Fohtung 1991) 
lists 248 languages. In addition to this diversity, French and English are the two 
official languages. Based on the national distribution of official languages there 
are officially two linguistic communities: the French-speaking community (cov- 
ering eight of the 10 regions) and the English-speaking community (covering the 
two remaining regions). Only the French-speaking community was covered in 
this study. 

Generally, the social status of French varies between urban and rural areas. 
Because urban areas are multilingual, French is the dominant language and it 
is acquired from an early age. This is the case in Bafoussam, the urban area in- 
vestigated, where children acquire French as first language with the presence of 
Ghomala’, the major local language spoken, and other minority languages. Com- 
pared to urban areas, rural areas provide a more homogeneous sociolinguistic 
environment where children are exposed both to the local language and French, 
and their contact with French intensifies with start of kindergarten at age three. 
Here, the local language is the main instrument of communication especially in 
the family environment and French is primarily used outside the home, especially 
by young people. Therefore in rural areas, the use ofthe local language is socially 
dominant although French is dominant at school and for public services. This is 
the case of Bandjoun, the rural area investigated in this study, where Ghomálá' 
and French are spoken and where the child's contact with French intensifies with 
entry into kindergarten. 
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2.2 Ghomala’ and French 


Ghomálá' and French are the main languages spoken in the Mifi, Kounghi-Khi 
and Hauts Plateaux administrative divisions in the western region of Cameroon 
(Breton & Fohtung 1991). Compared to French, Ghomálá' is a tone language with 
three simple and two complex tones (see Nissim (1981) for a detailed description 
of Ghomálá' phonology). Classified as a Bamiléké language (one of the Bantu 
languages) only spoken in Cameroon (Dieu & Renaud 1983), Ghomálá' is com- 
prised of 18 dialectal varieties with mutual comprehension. However, the variety 
of Bandjoun (Ghomálá' jo) is used as the standard reference (Domche & Hatfield 
1991). Thus, this study covers speakers of this variety. Regarding French, children 
are exposed both to standard French (FS) mainly at school and through television 
and radio, and to the variety of Cameroonian French (FC) spoken in the western 
region of Cameroon. Besides, the French pronunciation to which the child is ex- 
posed varies according to the education level of the speakers. The elite with a 
high education level makes greater use of FS, while uneducated or less educated 
parents speak more in the local FC (Biloa 2004). This means that children are 
exposed to both varieties of French. 

Ghomálá' and French share many consonants. The consonant system of FC is 
based on FS phonology, comprising stops and fricatives, labial /p, b, m, f, v/, apical 
/t, d, n, s, z, l/, palatal /f, 3, n/, velar /k, g/, uvular /5/ and glides /w, y, j/. The major- 
ity of these consonants are oral against only three nasal /m, n, p/. However, due 
to language contact, the FC may also include the nasal velar /n/, the glottal frica- 
tive /h/ and the glottal stop /?/ (see Biloa (2004) for a detailed description of FC). 
In addition to these phonemes, Ghomálá' also has the affricated consonants /pf, 
bv, ts, dz, tf, dz/ and the velar fricative /v/ (Mba & Domche 1995). However the uvu- 
lar /s/ is underrepresented in Ghomálá' (Mba & Domche 1995). For this reason, 
and as reported by Biloa (2004), some Ghomálá' speakers of French may exhibit 
one or more of the following interference signs: in the coda position, /s/ may be 
omitted (resulting in vowel lengthening or shortening), replaced by a glottal or 
a velar stop (e.g. [axza] ‘money’ pronounced [a:a], [agza] or [a?za]), or simply 
replaced by the apical [r]; in syllable initial, /&/ may be replaced by the apical [r] 
or the lateral [l]; finally in the consonant group, /g/ may be omitted. Adding to 
this variation in the use of the uvular /s/, the phonological interference between 
French and Ghomálá' also appears through a tendency to articulate the mute e 
in syllable final (e.g. [tabla] instead of [tabl] ‘table’). There may be other interfer- 
ence processes but the ones listed here are the most frequent characteristics of 
the local “accent”. 
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Finally, the distribution of consonants in the syllable structure shows some 
similarities and differences between French and Ghomala’. In Ghomálá' all the 
consonants can appear in initial position except for the glottal stop /?/, but only 
the stops /p, m, k, n, ?/ can appear in final position. However in French, all the 
consonants, including the palatal glide /j/, appear in both positions but the labial 
glides /u/ and /w/ don't appear in final. Concerning the structure of consonant 
groups, in Ghomálá' they only appear in initial position and can have from two 
(CC) to four (CCCC) consonants. CC structure can be nasal + stop (e.g. /gkáp/ 
‘money’) or stops + /h/ (e.g. /pho/ bag, and CCC and CCCC structures are made 
of (nasal+) nasal+stop+glide (e.g. /nkwa/ ‘foot’; /mgkwo/ ‘feet’), nasal+stop+ /h/ 
(e.g. /mpho/ ‘bags’) or nasal+nasal+stop (e.g. /mntáp/ ‘shoes’) where the first 
nasal may be the plural morphem /m-/. These structures are completely different 
from French where consonant groups may appear in initial and final syllable po- 
sitions, and be composed of two (CC) and even three (CCC) consonants. French 
CC are formed of fricative+stop (e.g. /staz/ ‘traineeship’), stop or fricative + liq- 
uid or glide (e.g. /te&/ ‘train’; /bwa/ ‘wood’; /tabl/ ‘table’), while CCC structure 
may be composed of liquid +stop+liquid (e.g. /asbs/ ‘tree’) or stop+liquid+glide 
(e.g. /plui/ ‘rain’). 


3 The study 


3.1 Objectives 


Using a clinical linguistic approach, the general objective of this study is to ana- 
lyze SSD among a population of Cameroonian bilingual children. Specifically, the 
study aims to assess the prevalence of these disorders in preschool- and school- 
aged children and to describe their profile based on these age groups; that is, 
from 4-8 years old, in order to develop an intervention strategy. Besides, we 
also considered the fact that, generally, four-year-old children already have a 
good knowledge of the consonant system of their language (e.g., MacLeod et al. 
2011). However, as Cameroonian children are often exposed to two or more lan- 
guages, it is necessary to distinguish normal developing children from children 
with SSD. Based on the literature about the differences between bilingual and 
monolingual language acquisition, we have developed a procedure accordingly, 
as there is still no normative language data for the population under study. Even 
though the analysis of French consonant acquisition reveals that children liv- 
ing in a French dominant area master their consonant system around the age 
of six Rvachew et al. (2013), researchers often point out the difference in the de- 
velopmental trend of bilinguals compared to monolinguals, and the impact of 
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the interaction between languages in bilingual language acquisition Paradis et 
al. (2011). Therefore in this study, we limited the analysis to the consonants that 
are common to the languages spoken - French and Ghomala’ - in order to avoid 
all variations related to language dominance in the child's environment. We also 
considered that some errors may be due to that environment. 


3.2 Method 
3.3 Participants 


This study is based on a sample of 1127 bilingual French-Ghomálá' children aged 
4-8 years, who attended eight schools (five elementary schools and three kinder- 
gartens) with French as the only language of instruction (see 82.1 above for de- 
tails). At the time of the investigation these children had been attending school 
for at least one year. The constitution of this sample was determined by the 
schools that were chosen according to their location, size and accessibility. As 
it appears in Table 2, this sample includes 54% of children living in the rural ar- 
eas of Bandjoun, a Ghomálá -dominant locality, and 46% in the urban areas of 
Bafoussam, a French-dominant locality. There were 49% girls and 51% boys. The 
distribution of these children by age was as follows: four-year-olds = 10% (117 
children), five-year olds = 22% (254 children), six-year olds = 25% (279 children), 
seven-year olds = 20% (221 children) and eight-year olds = 23% (256 children). 


Table 2: Description of the sample based on school level, gender and 
socio-linguistic context (N = 1127 children) 


School level Gender Sociolinguistic 
context 


Maternelle? SIL? CP° girls boys rural urban Total 


N 136 518 473 553 574 606 521 1127 
7 12 46 42 49 51 54 46 100 
^(kindergarten) 
"(grade 1) 
"(grade 2) 
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3.4 Procedure 


To assess the children’s speech-sound performance, we followed the recommen- 
dations of speech-language therapists regarding the combination of informal and 
formal procedures so as to have a sample of connected speech, to elicit a set of 
single-word productions by picture- or object-naming and to assess sound pro- 
duction in a repetition test. We therefore used a procedure based on the one 
proposed by Maurin-Cherou (1993) for French-speaking children. In so doing, 
we were able to generally evaluate each child’s speech comprehension as well. 
Data collection took place over two school years, one school year per area, and 
was realized by the author of this paper who speaks Ghomálá' and French as first 
languages, in collaboration with a linguist specialized in applied linguistics who 
verified the API transcription, and an ORL specialist who performed the medical 
exam. Schools were also selected based on their location, size and geographic ac- 
cessibility. Subsequently, administrative formalities were completed in order to 
obtain the necessary authorizations to have access to the selected schools (which 
report to the education department) on the one hand and the classrooms (which 
are the responsibility of school principals) on the other hand. There were four 
steps in the data collection process: the teachers' training workshop, the prelim- 
inary screening, the language and speech assessment, and the ORL exam. 


Teachers' training workshop: After classifying the children by age groups, their 
classes and teachers were then identified. Then an information session 
and a training workshop were organized with the teachers of the selected 
classes about the preliminary identification of children at risk of devel- 
oping speech-language disorders. For this preliminary screening, we ex- 
plained the following criteria to the teachers: 


1. The child who speaks poorly (errors in the pronunciation of sounds, 
syllables, words and sentence) 


. The child whose language expression is difficult to understand 


. The child who has difficulty hearing and/or understanding 


2 

3 

4. The child who understands after several repetitions 

5. the child who is agitated or violent in class or out of class 
6 


. the child who is taciturn and doesn't speak in class or out of class 


We were assisted by a speech-language therapist for the preparation of this 
workshop. 
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Preliminary screening: After the training, teachers were given three months to 
draw up a list of children who met at least one of these criteria. The chil- 
dren identified were then assessed on their language use and speech pro- 
duction. This procedure had the merit of having the screening done on the 
basis of what is accepted as normal to the local population. A total of 100 
children were listed by the teachers. 


Oral language assessment: This assessment was conducted in an informal set- 
ting while establishing contact with the child before the formal assessment 
of speech sound production. First, the child was asked to talk about his/her 
usual activities and/or to tell a short story. The connected speech sam- 
pling was done in Ghomala’ and French (see the Appendix for the list of 
some Ghomala’ words used by the children). We then conducted a short 
vocabulary comprehension test following Maurin-Cherou’s protocol using 
some items randomly chosen from the picture-naming test. The child had 
to show the image of the spoken word among four choices. All the children 
passed this test. This assessment took about 15 minutes and the connected 
speech was recorded using an audio tape recorder. 


Speech sound assessment: Each child was evaluated during one session using 
two speech tests administered by the author of this paper: a picture- 
naming test and a word repetition test all in French. The child was first 
asked to name each picture of the naming test. Then s/he was asked to 
repeat a list of words chosen based on the sounds and syllabic structures 
that were difficult in the first test. The assessment lasted between 40 and 
45 minutes and it was also recorded. 


* The picture-naming test consisted of producing 64 words prompted 
by colour pictures depicting well-known objects chosen from the lo- 
cal environment (see the Appendix for the list of the 64 words). For 
some words, we used the object itself (e.g., for the word "grain" we 
used a sample of mixed grains of rice, corn and beans). The test as- 
sessed consonants common to Ghomálá' and French but French was 
mainly used as the language of assessment given its dominant status 
at school. However, an eight-year-old girl and a four-year-old boy 
said a few words in Ghomala’, in which case s/he had to repeat after 
the examiner. Each consonant appeared in different syllable positions 
except for the fricative [v]. The glides [w] and [j] mainly appeared in 
the consonant group. Other structures evaluated consisted of conso- 
nant+liquid, and the syllabic structures V, VC, CV, CVC. The children 
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were able to name all the images as they reflected realities of their 
daily lives. 


* The repetition test assessed the impaired consonants in different po- 
sitions. The child had to repeat the word clearly articulated by the 
examiner. 


Oto-Rhino-Laryngology (or ORL) exam: Only children identified as having SSD 
benefited from an ORL exam. The exams were conducted by a specialist in 
each of the target schools to evaluate the anatomical and physiological au- 
ditory system as well as the oral and respiratory system. Given that none of 
the screened children had organic disorders that could affect speech sound 
production (e.g. Hearing loss), in this study we present only the results of 
the speech sound assessment. 


3.5 Data analysis 


The identification of SSD was based on the analysis of consonants and syllabic 
structures from the connected speech sampled, the naming picture test and the 
repetition test. Each child's production was collected using an individual form 
and recorded using an audio recorder. At the end of each day, for each child, 
data collected by the author of this paper were transcribed using the IPA and 
verified by an applied linguist. Speech disorders were identified by analyzing the 
child's pronunciation based on expectations given his/her age. This was based on 
the consonant acquisition of 127 four-to-five-year-old French-speaking Cameroo- 
nian children living in a multilingual environment (Takam Unpublished). Gener- 
ally, the results of this preliminary study showed no significant difference be- 
tween the age groups (four-year-olds and five-year-olds): at least 90% of the chil- 
dren were able to accurately produce the consonants [p, b, m, n, p, j, f, s, v, k, d, 
3, E, W, Z], around 80% the fricative [f] and only 60% the lateral [1]. In the current 
study, errors were analyzed to determine the phonological processes classified 
as substitution, assimilation, omission, addition or metathesis as presented in the 
theoretical framework summarized in Table 3. 


3.6 Result 


3.6.1 Prevalence rate 


Following the preliminary screening, 100 children were listed as presenting at 
least one of the criteria. At the end of the assessment, 32 children were excluded 
from this sample because they did not present any of the speech errors above (see 
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Table 3: Classification of speech errors by structure (sound and sylla- 


le) 
Structure Error Phonological process leading to the... 
Sound Substitution replacement of a difficult sound by another 
causing a change in point or mode of articulation 
Distortion rough articulation producing a false noise 


Assimilation replacement of a sound which becomes like a 
nearby sound (sounds harmony) 


Metathesis ` position change of one or more sounds in the 
word 
Syllable ^ Omission erasing of a sound 
Addition vocalic or consonantal addition 
Metathesis ^ position change of one or more sounds in the 
word 


Table 3). Finally, 68 children presented with SSD in the form of phonetic disorders, 
phonological delays or phonological disorders, which represents a prevalence 
rate of about 6% of the 1127 children in the population studied. Tables 4 and 5 
present this prevalence by gender, sociolinguistic context, school level and age 
group. There was a significant difference in prevalence rates by gender with 4.5% 
of girls compared to 7.5% of boys [Y? = (1, N = 1127) = 4338, p = 0.03] and a 
ratio of 3.2. The prevalence rates by school level were also significantly different 
between kindergarten (8.8%) and grade 1 pupils (7.5%) on the one hand, and grade 
2 pupils (3.6%) on the other [y” = (2, N = 1127) = 8.88, p = 0.01]. However, the 
percentage by age [y? = (4, N = 1127) = 1.04, p = 0.90] and by sociolinguistic 
context [y? = (1, N = 1127) = 0.37, p = 0.54] were not significantly different. 


3.6.2 Speech sound profile 


On a segmental level, the analysis by age group showed some differences in the 
percentage accuracy of individual consonants as presented in Table 6. There was 
a considerable decrease in the number of highly impaired consonants with age, 
but less variation in the consonants mode of articulation as they appear to be al- 
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Table 4: Prevalence rate by gender, sociolinguistic context and school 
level. N = general population; n= children with speech disorders; 
Mat = kindergarten (4-5 years); SIL = grade 1 (5-6 years); CP = grade 
2 (6-8 years) 


Gender Sociolinguistic School level Total 
context 


Girls Boys Rural Urban Mat SIL CP 
553 574 606 521 136 518 473 1127 


n 25 43 39 29 12 39 17 68 
% 4.5 7.5 6.4 5.6 8.8 7.5 3.6 6.0 
Table 5: Prevalence rate by age groups 

4years 5years (6 years 7years 8 years 
117 254 279 221 256 
n 8 16 19 11 14 


% 6.8 6.3 6.8 5.0 5.4 


most fricatives. Generally, the sounds [l, x] were impaired in all age groups while, 
on the contrary, the nasals [m, n] and the voiceless bilabial [p] were well used. 
Three categories of impaired consonants emerged following the classification of 
Shriberg & Kwiatkowski (1994)!. Consonants [s, f, z, 1, x] were the most impaired 
with more than 30% of the children concerned, followed by the velars [k, g] with 
10-30% of children, and finally the sounds [v, n, f, j, w, y, b, t, d, 3] which were 
impaired for less than 10% of the children. 

As far as syllable structures were concerned (see Table 7), the CVC structure 
was the most impaired in all age groups with a frequency of about 97%. Interest- 
ingly, all eight-year-olds altered this structure against 87% of four-year-olds. On 
the other hand, the use of the consonant groups was much more laborious for 
four-to-five-year-olds. 


'Shriberg & Kwiatkowski (1994) distinguished three categories of consonants: the early-8 [m, b, 
j, n, w, d, p, h] with 100-80 percentage of correct production, the middle-8 [t, , k, g, f, v, tf, dz] 
with 70-30%, and the late-8 [f, 0, s, z, 6, 1, r, 3], with less than 20% frequency of correct use. 
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Table 6: Impaired consonants by age group. Not impaired: 0% of the 
children; Low frequency: consonants impaired for less than 10% of the 
children; Medium frequency: for 10-30%; High frequency: for more 


than 30% 
Age groups Not impaired Medium Highly 
4 years p. m, n, p, y j, v,d, w, f, t, 3, g, s, Zz, f,l, 5 
(n = 8) b, k 
5 years p, m, n, v, j An d, w, f,t ų, g, k, b, s z, f, LE 
(n= 16) 
6 years p, m, n, 3, v,b,p,d w, f, v,t y, g, k, f l, s, Z, § 
(n = 19) 
7 years p, m, n, p, v, f, b y, k, 3 j, w, t, g, d, s, 
(n= 11) Z 
8 years p, m, n, W, 3, Y v, b,n, f,j,t, gk s, f, z, LE 
(n = 14) d 
4-8 years p,m,n pv£uzaj gk s, f, z, LE 
(n = 68) w, b, t, d 
Table 7: Impaired syllable structures by age group 
4 years 5years 6years 7years 8years 4-8 years 
n 8 16 19 11 14 68 
CV 6.2 31.2 36.8 27.3 28.6 32.3 
VC 75.0 37.5 57.9 27.3 35.7 52.9 
CC 75.0 68.7 31.6 36.4 35.7 52.9 
CVC 87.5 93.7 94.7 90.9 100 97.0 
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Finally, omission and substitution were the most prevalent error forms as 
shown in Table 8. The Chi-square analysis of the percentage of children by age 
groups for each error reveals no significant difference. However, for distortion er- 
rors, the z-test with adjustment of the values according to the Bonferroni method 
indicated significant higher prevalence rates for four-year-olds (88%) and six- 
year-olds (80%) compared to the three other age groups: five-year-olds (44%); 
seven-year-olds (64%) and eight-year-olds (50%). (1) to (5) below show detailed 
evaluations of these types of errors. 


Table 8: Frequency of speech sound errors by age (N=68 children, 4-8 
years) 


Errors n %  Y'(A4N-68) p 

Omission 68 100 5.26 0.26 
Substitution 61 89.7 4.36 0.36 
Distortion 43 63.2 7.12 0.10 
Addition 32 44.1 5.69 0.22 


Assimilation 22 32.4 4.28 0.37 


Beginning with forms of omission, all the children revealed at least one of the 
following: syllable initial and final consonant omission, omission of an entire 
syllable, simplification of consonant groups and complex syllabic structures (see 
(1) below for examples). Final consonant omission was the most frequent form 
regardless of age (approximately 93% of the children), followed by the simplifica- 
tion of consonant groups (about 65% of the children) and the omission of syllable 
initial consonants (about 43% of the children). 


(1) Omission errors 


a. syllable final consonant omission 
(ardoise) [ag-dwaz] > [a-dwaz] — VC > V 
(couscous) [kuskus] > [ku-kus] — CVC > CV 

b. syllable initial consonant omission 
(parapluie) [pa-sa-plui] > [pa:-plyi] — CV>V: 
(arachide) [a-wa-fid] > [a:-fid] — CV>V: 

c. simplification of consonants group 
(crayon) [kse-j5] > [ki-j5] — CCV > CV 
(table) [tabl] > [tab] — CVCC > CVC 
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d. 


e. 


syllable omission 

(nourriture) [nu-si-tys] > [nu:-ty:] — CV 

(télévision) [te-le-vi-zj5] » [te-le-sj5] — CV 
simplification of complex syllable structures 

(arbre) [a:6b5] > [a:b] — VCCC > VC 

(parapluie) [pa-sa-plui] > [pa-ha-ply] — CCCV > CCV 


Substitution errors appeared as a systematic or an inconsistent replacement 


of a consonant leading to a variety of errors such as fronting, backing, stopping, 
gliding and devoicing (see (2) below). In general, the substitution of [f, 3] to [s, z] 
was the most frequent with a 68% prevalence rate, followed by backing, gliding 
and fronting errors with a 40-60% prevalence rate. Apart from devoicing (15% of 
the children) the other forms listed had prevalence rates less than 10%. 


(2) Substitution errors 


a. 


fronting 

(couscous) [kuskus] » [tutu] — [k] » [t] 
(chat) [fa] > [sa] — [f] > [s] 
substitution of [1] to [j] 

(crayon) [kgej5] > [keel5] — [j] > [I] 
(bouteille) [butej] > [butel] — [j] > [I] 
backing: 

(tasse) [tas] » [kas] t] » [k] 
(train) [twé] > [kkë] t] > [k] 
substitution of [f,3] to [s,z]: 
(tasse) [tas] > [taf] — [s] > [f] 
(maison) [mezo] > [mezo] — [z] > [3] 
stopping: 

(savon) [sav5] > [tav5]] 
(carte) [kart] > [kakta] 
substitution of [1] to [x] 
(robe) [sob] > [lob] — [x] > [I] 
(orange) [oxàz] > [olaz] — [x] > [l] 


x 
>I 


[s] > [t] 
[z] > [k] 


— 
— 


gliding: 

(fleur) [flex] > [fwox] — [l] > [w] 

(ballon) [bal5] > [baj3] — [1] > [j] 
substitution of vowel to glide 

(parapluie) [pasaplui] > [palaply] — [u]» [y] 
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i. Devoicing: 


(doigt) [dwa] >[twa] — [d]> [t] 


Regarding the errors classified as distortions, about 63% of the children glot- 
talized /&/ and only 12% of them also distorted the apical /t/ and the velar /k/ 
in syllable initial. It is worth recalling that in this study, distortion errors were 
neither a substitution nor an assimilation but an approximate production of a 
segment. Based on Table 3 above, the Chi-square analysis revealed no signifi- 
cant difference by age group [x? = (4. N = 68) = 7.12, p = 0.10]. However, the 
z-test with adjustment of the values according to the Bonferroni method indi- 
cated higher prevalence rates for four-year-olds (88%) and six-year-olds (80%) 
compared to the three other age groups: five-year-olds (44%); seven-year-olds 
(64%) and eight-year-olds (50%). (3) below shows examples of distortion errors 
observed. 


(3) Forms of distortion errors 

a. Distortion of [t]: 
(pantalon) [pätal5 ] > [pat^al5] — [t] > [th] 
(voiture) [vwatys] > [vwathy] — [t] > [t^] 

b. Distortion of [k]: 
(car) [kax] > [kła] — [k] > [k^] 
(crayon) [kxej5] > [k^ej5] — [k] > [kh] 

c. Distortion of [x]: 
(radio) [sadjo] > [hadjo] — [x] > [h] 
(robe) [sob] > [hob] — [x] > [h] 


Sound addition errors were present in about 44% of the children. One form was 
the simplification of a complex syllable structure by adding an epenthetic vowel. 
It concerned about 15% to 37% of children between the ages of 6-8 years. The 
other form, the most prevalent, was the adding of a consonant at the beginning 
of words with vocalic attack. It affected all age groups. (4) below illustrates these 
forms of errors. 


(4) Addition errors 
a. Vocalic addition (epenthesis) 
(fleur) [flex] > [fo-1e:] — CCVC > CV+CV 
(brosse) [bros] > [ba-los] — CCVC > CV«CVC 
b. Consonant addition 
(ardoise) [ax-dwaz] > [nag-dwaz] — VC > CVC 
(hache) [af] > [af] — VC > CVC 
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Finally, assimilation errors appeared as regressive or progressive consonant 
harmony and consonant devoicing as shown in (5) below. About 32% of children 
presented one or more of these errors without any significant difference between 
groups. Regressive assimilation of /f/ and devoicing errors were the main forms 
observed. 


(5) Assimilation errors 


a. Progressive assimilation 
(pantalon) [patal5 ] > [papal5 ] — [t] > [p] 
b. Regressive assimilation 
(chemise) [famiz] > [semiz] — [f] > [s] 
(chapeau) [Japo] > [papo] — [I] > [p] 
(fourchette) [fuxfet] > [fuxset] — [f] > [s] 
(chocolat) [fokola] > [kokola] — [f] > [k] 
(arachide) [agafid] > [aratid] — [f] > [t] 
c. Devoicing 
(gâteau) [gato] > [kato] — [g] > [k] 
(table) [tabl] » [tap] — [b] » [p] 


3.6.3 Discussion 


This study was aimed at assessing SSD from a representative quasi-random sam- 
ple of 1127 bilingual Cameroonian children aged four to eight years and living 
in two Ghomala’-French environments (rural and urban). The prevalence rate 
of 6% obtained in this study is comparable to the rate of 7.8% obtained by Fom- 
bonne & Vermeersh (1997) among French children between four and 16 years 
of age. In general, Kirkpatrick & Ward (1984) place this prevalence between 4% 
and 6%. These studies indicated a higher prevalence for boys compared to girls 
as was also the case in the present study. This difference is not yet explained 
in the literature. However, this discrepancy is also observed in regular language 
development as it is well admitted that girls generally develop speech and lan- 
guage faster than boys. Yet, this speed difference is significant, up to 30 months 
(Eriksson et al. 2012). 

Regarding the differences related to school level, it is important to note that 
most children with SSD were in kindergarten and grade 1 (SIL). Many of the first- 
grade children in this study had failed to progress to the second grade (CP). These 
results can be interpreted as evidence that SSD is a handicap to children's aca- 
demic performance. Several studies have linked SSD to children's education and 
it is now well recognized that they can give rise to learning disabilities such as 
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difficulty learning written language graphemes (e.g. Rvachew 2007). One expla- 
nation is that SSD is often related to a phonological awareness deficit which plays 
a significant role in written language learning. This causal relation is mainly de- 
termined by the severity of the disorders profile regarding types of errors (e.g. 
omission) and error pattern, whether they are typical or atypical among normal 
developing children Rvachew et al. (2007). Omission errors are considered more 
severe than substitution as are atypical errors compared to typical ones. 

Concerning the speech profile of the disorders in our sample, the main errors 
were omission and substitution without a significant difference between groups. 
Omission errors mainly occurred as final consonant omission and consonant 
group simplification, while the substitution of [f, 3] to [s, z] was the most fre- 
quent with about 68% of the children, followed by backing, gliding and fronting 
errors with a 40-60% prevalence rate. According to Rvachew et al. (2007) and 
Fox & Dodd (2001), all these forms of errors are also typical to normal devel- 
oping children. Consonants [s, f, z, 1, x] and CVC syllables were the most im- 
paired structures. These results are consistent with the literature both regarding 
type of errors (e.g. Austin & Shriberg 1997; Ruscello et al. 1991) and impaired 
consonants (e.g. Maurin-Cherou (1993) for French-speaking children; Shriberg 
& Kwiatkowski (1994) for English-speaking children). Besides, our analysis em- 
phasized the importance of the syllabic structure including the lexical position 
and environment of consonants. Very few studies (e.g. Rvachew et al. 2007) have 
examined this aspect while assessment and intervention always include them in 
order to better evaluate speech difficulties (e.g. Maurin-Cherou 1993). 

Regarding the impairment of [x] in this study, all the children had difficulties 
with its use. This consonant is among the most frequent in French (Genouvrier 
& Peytard 1970) while it is almost absent in Ghomálá' (Mba & Domche 1995). 
Some of the error forms reported here were the same as those observed in the 
sociolinguistic context of the children as described by Biloa (2004). However, the 
difficulties with this sound were not only due to language interference or input 
given that the errors were not always linked to the sociolinguistic context. In 
fact, several studies have reported difficulties with this consonant in different 
linguistic environments. In English (or native English) children, the apical [r] 
is usually substituted for the glide [w] in initial position (Bowen 2014). As for 
monolingual Francophones, this is a sound frequently affected by SSD Maurin- 
Cherou (1993). 
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3.7 Conclusion 


The objective of this study was to analyze child SSD in Cameroon in order to 
encourage the rehabilitation of language disorders in the public health system. 
Using a sample of 1127 bilingual children between the ages of four and eight years, 
the study determined the prevalence and speech profile of these disorders. In a 
nutshell, 6% of the children presented with SSD without a significant difference 
between age groups and sociolinguistic context (rural and urban). However, boys 
were more affected than girls with a ratio of 3.2. On the phonic level, consonants 
fe L s, z, f] were the most impaired (with 30% to 100% of the children screened) 
and the most frequent errors were omission and substitution. On the syllabic 
level, complex structures in general and the CVC structure, in particular, were the 
most impaired with a frequency of more than 90% followed by consonant groups 
with more than half of the children. The need for intervention no longer needs 
justification since these disorders have an impact both on individual and school 
performance. There are some limits to this study regarding the comorbidity of 
the speech disorders studied. None of the screened children had any difficulty 
understanding oral language. However, since we did not assess their language 
abilities (e.g., grammatical knowledge), it is possible that some of them may have 
a language delay. 


Appendix 

A sample of Ghomala’ words used (IPA) 

/pha/ ‘bag’ /bap/ ‘meat’ /sattha/ ‘comb’ 
/pa?/ ‘house’ /dap/ ‘thread’ /ywa?na/ ‘book’ 
/bje/ ‘groundnuts’ /nok/ ‘snake’ /dom/ ‘axe’ 
/mok/ ‘fire’ /sok/ ‘soap’ /bvu/ ‘dog’ 

/nam/ ‘sun’ fò? ‘hat’ /pusi/ ‘cat’ 

/wasi/ ‘watch’ /nkedé/ ‘banana’ /dza/ ‘clothe’ 
/sedja/ ‘broom’ /tsa/ ‘cola nut’ /wokta/ ‘umbrella’ 
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List of the 64 words (in French) used for picture naming and word 
repetition tests 


ananas casque grains pipe 
arachide chaise hache plantain 
arbre chapeau huile porte 
ardoise chat journal radio 
assiette chaussure livre robe 
avion chemise maison sac 
bague couscous marmite savon 
ballon crayon montre serpent 
banane doigt nourriture soleil 
blanc drap œuf table 
bonbon fenêtre oignon tasse 
bouteille feu oiseau téléphone 
brosse fleur orange télévision 
cahier fourchette pantalon train 
autocar gâteau parapluie voiture 
carte gomme peigne yaourt 
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Efik nominal tonal alternations as 
phrasal morphology 
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Certain Efik nominal constructions exhibit fixed tonal melodies that overwrite 
nouns’ underlying tones. Previous analyses of these alternations (Welmers 1973; 
Kim 1974; Cook 1985) are purely phonological. Working in a constraint-based 
framework, I propose that the tonal alternations are actually phrasal morphology 
(McPherson 2014). The tonal melodies are overlays encoded in lexicalized construc- 
tional schemas that relate idiosyncratic phrasal phonology with specific syntactic 
constructions. The constructional schemas are enforced by constraints. The Efik 
case extends the observed range of phrasal morphology by demonstrating that 
constructional schema constraints and phonological constraints can interact to de- 
termine a construction’s surface tones. 


1 Efik nominal tonal alternations 


Efik (Niger-Congo: Benue-Congo: Cross River) is a language spoken in Cross 
River State in southeastern Nigeria (Cook 1985). It has two tones, high (H) and 
low (L), which may combine in a single syllable to produce falling (HL) and rising 
(LH) tones (Welmers 1968). Additionally, there is a downstepped high tone (H) 
whose pitch is lower than H but higher than L. I analyze lH as an H after a 
floating L. 

In certain nominal constructions, including noun-noun compounds, adjective- 
noun constructions, and genitive constructions, nouns exhibit tonal alternations. 
For instance, in the noun-noun compound ‘dog house; the underlyingly H-H 
noun ébwá ‘dog’ is realized as H-L after the underlyingly H-L noun úfðk ‘house’: 
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(1) /ufsk ébwá/ — [üf5k ébwà] 
house dog 
‘dog house’ 


The complete patterns of nominal tonal alternations in compounds are given 
in Table 1. These patterns were reported by Welmers (1968) and Cook (1985) and 
confirmed with new data elicited from six native speakers of Efik. 


Table 1: Surface tones on the second noun in noun-noun compounds 


Underlying tonal shape of second noun 


Underlying tonal 


Group 1 Group 2 
shape of first noun 


HH H-HL HL LL LH L-HL^ H-H 


Alternation 1 H-H 
H-L 
L-L H-L HH 


Alternation 2 H-HL L-L L-H 


“L-HL nouns actually preserve their final fall, surfacing as H-/-HL and L-HL under Alternations 
1 and 2, respectively. 


As Table 1 shows, disyllabic nouns occur in seven tonal shapes: H-H, H-L, L-H, 
L-L, H-HL, L-HL, and H-lH (Welmers 1968: 86). Nouns longer than two syllables 
still exhibit one of these seven tonal melodies. In a noun-noun compound, only 
the tones of the second (non-head) noun alternate; the tones of the first noun sur- 
face unchanged. The seven nominal tonal shapes can be divided into two groups 
according to the pattern of alternations they exhibit. Group 1 comprises H-H, H- 
HL, H-L, and L-L nouns while Group 2 comprises L-H, L-HL, and HH nouns 
(the terms Group 1 and Group 2 come from Welmers 1968). Additionally, there 
are two alternation patterns in compounds, triggered by two different sets of 
nouns. Alternation 1 is triggered by first nouns with tonal shapes ending in H 
or L while Alternation 2 is triggered by first nouns with tonal shapes end in HL 
(Alternation 1and Alternation 2 are also Welmers’ terms). Each tonal shape group 
exhibits a different output under each of the two tonal alternations, yielding a 
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total of four patterns. After a noun ending in H or L, Group 1 nouns surface as 
H-L, and after a noun ending in HL, they surface as L-L. Group 2 nouns surface 
as H-!H after a noun ending in H or L and as L-H after a noun ending in HL. 
Thus in (1), the noun ébwá ‘dog’ surfaces as H-L in the compound ‘dog house’ 
because it is a Group 1 noun (with underlying tones H-H) and it occurs after the 
noun üf5k ‘house, which ends in L. 

While complex, these tonal alternation patterns can be summarized in a few 
generalizations. There are essentially two tonal melodies, HL, which corresponds 
to Alternation 1, and L, which corresponds to Alternation 2. The melody HL oc- 
curs after nouns ending in a level tone (H or L) while the melody L occurs after 
nouns ending in a falling tone (HL). The surface melodies H-!H and L-H exhibit 
the melodies HL and L, respectively, but instead of continuing to the end of the 
word, the melodies stop before an H that is the final tone of the word. H-!'H 
and L-H arise in Group 2 nouns, which differ from Group 1 nouns in that their 
tonal shapes contain an H after an L (in H-!H nouns, the L that the second H 
follows is unassociated, manifesting as downstep). Thus in Group 2 nouns the 
tonal melodies HL and L extend only as far as the H that followed the original 
underlying L and no further, leaving that H to be realized on the second syllable 
of the noun. 

In (2), I provide a few more compounds that illustrate the tonal alternation 
patterns in Table 1: 


(2 a. /übóm ijak/ — [übóm ijàk] Alt. 1 Group 1 H-L 

boat fish 
‘fish boat’ 

b. /5f5n usan/ — [5f5y ásàn] Alt. 1 Group 1 H-L 
cloth dish 
‘dish cloth’ 

c. /üsàn ejim/ — [üsàn éljim] Alt. 1 Group 2 HH 
dish onion 
‘onion dish’ 

d. /áfók iwág/ — [üf3k tán] Alt. 1 Group 2 H-!H 
house farm 
‘farm house’ 

e. /ikwa ébwá/ — [ikwâ èbwà] Alt. 2 Group 1 L-L 
knife dog 


> 


‘dog knife 
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f. /àwà ikwak/ — [awa ükwàk] Alt. 2 Group 1 L-L 
cat iron 
‘iron cat? 


g. /ikwa iinwén/ — [ikwâ inwén] Alt. 2 Group 2 L-H 


knife bird 
‘bird knife’ 
h. /awa i‘nwen/ — [awa inwen] Alt. 2 Group 2 L-H 
cat bird 
‘bird cat’ 


Previous analyses of the Efik nominal tonal alternations (Welmers 1973; Kim 
1974; Cook 1985) are purely phonological. Only Cook 1985 is fully elaborated. For 
noun-noun compounds, Cook posits a construction marker /H L/, that is, a high 
tone followed by a low tone with no segmental material, that occurs between the 
two nouns of the compound (e.g. /u-fok ° ` é-bwá/ ‘dog house’). He derives the 
surface tones of the second noun with three phonological rules. Two are tonal 
assimilation rules triggered by floating tones. Assimilation to Floating L changes 
each H of a continuous string of Hs to L after a floating L (e.g. 666/—[ 6 
6 6 ]) while Assimilation to Floating H changes a single L to H after a floating H 
and applies iteratively (eg./ o060/—[ c c c ] — [ oaa ]) (Cook 1985: 
193). These rules seem somewhat arbitrary. Why does a floating L affect a whole 
string of following Hs (Assimilation to Floating L) while a floating H affects only 
a single following L but can apply iteratively (Assimilation to Floating H)? Why 
do floating tones but not associated tones trigger these assimilations? 

These two assimilation rules have at least some application outside nominal 
constructions in Cook’s phonology of Efik, but the third rule that derives the 
tonal alternations in compounds is purely ad hoc. This rule, called L Copying, 
copies an initial L of a noun when that initial L is preceded by a word boundary 
and a floating tone. (Specifically, the L copies onto an open transition, a unit 
specific to Cook’s phonological analysis of Efik. An open transition is a segmental 
phoneme that is not audible but can bear a tone and occurs immediately after the 
initial vowel of most nouns.) L Copying’s sole role in the grammar is to ensure 
that tonal alternations in compounds and other nominal constructions come out 
right. In particular, it is needed to derive downstep where it is observed to occur. 

While Cook’s analysis can account for all four tonal alternation patterns in Ta- 
ble 1, the rules it relies on are unsatisfying. Moreover, it does not capture the gen- 
eralizations stated above, namely that there are essentially two surface melodies, 
HL and L, and that consistent tonal properties of the first and second nouns in 
compounds give rise to the full set of four patterns. 
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Instead of treating the nominal tonal alternations as purely phonological, I 
propose a phrasal morphology account for them. This analysis is presented in the 
following section. I then consider an alternative, constraint-based phonological 
account and argue that the phrasal morphology account is preferable. 


2 A phrasal morphology account 


The phrasal morphology account I advocate treats the surface tones in Efik com- 
pounds as tonal melodies that are imposed in particular constructions. This ap- 
proach is reminiscent of Harry & Hyman (2014) approach to Kalabari nouns, 
which exhibit tonal alternations in many of the same constructions as in Efik. 
While they do not provide a fully implemented analysis, Harry & Hyman ar- 
gue for a constructional, rather than a purely phonological, approach in which 
Kalabari nouns in certain constructions, including compounds and genitive con- 
structions, lose their underlying tones and are assigned a particular tonal melody. 

My analysis of Efik is couched in a different constructional framework, that of 
McPherson 2014. In McPherson’s framework, lexicalized constructional schemas 
relate idiosyncratic phrasal phonology with particular syntactic constructions. 
For example, in specific syntactic constructions, certain word classes may impose 
tonal overlays on other words. In the case of Efik noun-noun compounds, it will 
be the first (head) noun that imposes a tonal overlay on the second (non-head) 
noun. Recall the generalizations from §1: the second nouns in compounds exhibit 
two melodies, HL and L, with HL occurring after a first noun ending in a level 
tone and L occurring after a first noun ending in a falling tone. In keeping with 
McPherson’s framework, there is a constructional schema for Efik noun-noun 
compounds that specifies a tonal overlay with two allomorphs, {HL} and {L}, and 
the environment in which each allomorph occurs. The constructional schema 
is associated with a constraint that enforces the application of the appropriate 
tonal overlay, and the interaction of this constructional schema constraint with 
other phonological constraints gives rise to the full range of surface tones seen 
on the second nouns of compounds. 

The constructional schema for Efik compounds is given in Figure 1. Construc- 
tional schemas show the correspondence of idiosyncratic phonology, including 
tonal overlays, with specific syntactic structures. The schema in Figure 1 states 
that in a noun-noun compound, if the first noun ends in a falling tone HL, the 
tonal overlay {L} is imposed on the second noun, and if the first noun ends in 
a syllable with a single tone (H or L) associated to it, the tonal overlay {HL} is 
imposed on the second noun. 
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PHON < NP 
ie | 
Oj Oj Gi Qj N 
l| ME NE 
OC [] o [] Ni Nj 
"d d | | 
H L {i} T {HL} 


Figure 1: Constructional schema for Efik noun-noun compounds 


This schema is enforced with the single constructional schema constraint 
N NI. which is satisfied when the correct allomorph of the tonal overlay is 
imposed on the second noun of a compound. N NL counts one violation for 
each associated tone in the output that does not match the overlay (we will see 
later why its evaluation is not binary). Following McPherson (2014), the comple- 
mentary faithfulness constraint I use is FAITH(T). One violation of FAITH(T) is 
incurred when a word's tones in the output do not match its tones in the input. 

The tableau in Figure 2 shows the derivation of the surface tone pattern H-L 
(Group 1 Alternation 1) with the compound úfək ébwa ‘dog house’ A superscript 
on a word indicates that a tonal overlay has been imposed, whether partially or 


fully. 


/afsk ébwá/ |N NO) FAITH(T) 
| d 4 
HLH H 
= a. ufok ébwàHL * 
GR? 
HLH, L 
b. ufok ébwá *] 
| | | 
HL Hi EH 


Figure 2: Tableau for ufok ébwa 


The first noun in the compound, /ufok/, ends in L, so according to the con- 
structional schema in Figure 1 it seeks to impose the overlay {HL} on the second 
noun, /ébwá/. The constructional schema constraint N NUL is violated if the 
tonal overlay is not exhaustively imposed. Candidate (b), the faithful candidate, 
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only partially satisfies the overlay, realizing it on the first syllable of ebwa but not 
imposing it over the final H of this word. The H on the final syllable of ebwa does 
not match the tonal overlay, so one violation of N NODE is incurred. Candidate 
(a) imposes the overlay exhaustively and so does not violate the constructional 
schema constraint. It changes the tones of ebwa, violating FAITH(T), but since 
N NE >> FAITH(T), it is the winner. Note that I assume lexical tones can real- 
ize tonal overlays. Thus in the winner the first tone of the {HL} overlay is in fact 
the original first H of ébwa; this lexical tone is maintained since it can realize the 
overlay. The same is true in candidate (b). 

The tableau in Figure 3 shows the derivation of the surface tones L-L (Group 
1 Alternation 2) with the compound íkwá ébwa ‘dog knife’. 


/ikwà |. ébwá/|N NL | FArTH(T) 


HHL H H 
c. ikwâ ébwal W * 
^| | 
HHLL H 


Figure 3: Tableau for íkwá ébwa 


The first noun of the compound, /íkwá/, ends in HL, so according to the con- 
structional schema it seeks to impose the overlay {L} on /ébwá/. The faithful 
candidate, (b), does not impose the overlay at all, thereby incurring two viola- 
tions of N NL. and candidate (c) imposes the overlay only partially, incurring 
one violation of N NL. Consequently, they both lose to candidate (a), which 
exhaustively imposes the overlay. 

Having accounted for the surface tones of Group 1 nouns in compounds, I now 
consider Group 2 nouns. The surface melodies that must be derived are H-!H and 
L-H. As discussed in 81, Group 2 nouns (those with the tonal shapes L-H, L-HL, 
Or HH) differ from Group 1 nouns in that they contain an H after an L (the L is 
floating in Gë nouns). After nouns ending in a falling tone, Group 2 nouns have 
the surface tones L-H (Group 2 Alternation 2). These nouns do exhibit the melody 
of the {L} allomorph of the tonal overlay, but the noun’s original underlying H 
after the underlying L is preserved. That is, the imposition of the overlay is halted 
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by the H after the L and does not extend beyond it. I therefore propose that an H 
following an L in the same word is preserved by the following special positional 
faithfulness constraint: 


(3) PRESERVEHPOSTL: An associated H that follows an L (associated or not) 
within the same word in the input must be associated in the output. 


This constraint, combined with the constructional schema constraint, derives 
the surface pattern L-H. The idea is that PRESERVEHPosTL blocks the tonal over- 
lay from proceeding past the H after the L. The tableau in Figure 4 illustrates this 
with the compound íkwá inwen ‘bird knife’. 


likwâ  i'nwén/|PRESERVEHPOSTL | N N®! | FAITH(T) 
^| | 
H H L HiL2Hs 
= a. ikwâ inwént * 
| A | || 
HHLL LH; 
b. ikwâ i'nwén mer 
| A | || 
HHLHiL;Hs 
c. ikwa inwént +I 
| A | | 
HHLLLL 


Figure 4: Tableau for íkwá inwen 


Since the first noun of the compound, /ikw4/, ends in HL, the constructional 
schema dictates that it impose the overlay {L} on /i‘nwen/. PRESERVEHPOSTL 
outranks N NO and prevents exhaustive realization of the overlay, eliminating 
candidate (c). Because N N@DL counts the number of tones that do not match 
the overlay, though, non-exhaustive realization, as in candidate (a), is still better 
than no realization, as in (b), so the winner is [ikwâ inwenl]. 

The final surface melody to be derived is H-!H (Group 2 Alternation 1). Group 2 
nouns show the H-!H pattern after nouns ending in H or L, so according to the 
constructional schema it is the {HL} overlay that is being imposed. The surface 
tones H-!H do in fact show the {HL} overlay; the downstep is caused by the pres- 
ence of an unassociated L. The question is why the L is unassociated. The final 
H in the HSH pattern can be explained by PRESERVEHPOSTL, which prevents 
the overlay from reaching the end of the word. If the {HL} overlay were fully 
realized on a Group 2 noun while the H after the L was preserved as well (i.e. if 
the {HL} overlay were realized on the first syllable of the noun and the H were 
preserved on the second syllable), the result would be an HLH sequence within 
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a word. This sequence does not surface, so I claim that the constraint “HLH is 
active: 


(4) *HLH: Within a word, don’t have the tonal sequence HLH, where all 
three tones are associated.! 


“HLH has been proposed for a variety of languages (Cahill 2007; McPherson 
2016) and seems to reflect a desire to avoid multiple sharp changes in pitch in 
close proximity (cf. Hyman's (2007) Principle of Ups and Downs). There is inde- 
pendent evidence that *HLH is active within the word in Efik. As mentioned in 
81, there are seven surface tonal shapes for Efik disyllabic nouns. All two-tone 
combinations of H and L are attested, as are the shapes H-HL, L-HL, and H-lH, 
but the shapes H-LH and HL-H are not attested. Additionally, *HLH constrains 
the surface tones of reduplicated verb forms (Glewwe 2017). 

Because realizing the {HL} overlay and preserving a final H would violate 
“HLH, the L delinks, yielding the surface tones H-!H. The tableau in Figure 5 
shows the derivation of this pattern with the compound üsän él jím ‘onion dish’. 


/ùsàn éjím/ 
| | \ 
LL Lil 


*HLH 


N N (H)L 
FAITH(T) 


PRESERVEHPOSTL 


V 
P 
Ee 
Lo 
FA 
B 
œ 

u 
8 

B 

* 

* 


b. üsän éjim Poen 


c. san êjim™ | | #1] * | * 
AN i 
L LHL H2 ; 
d. üsàn éjim™ | *! | v 
| | 
LL HL 


Figure 5: Tableau for usan é jim 


As it ends in L, /üsàn/ seeks to impose the overlay {HL} on /éjim/. Candidate 
(d) fully realizes the overlay, but it violates PRESERVEHPosTL, so it is eliminated. 
Candidate (c) preserves the H following the L in /éjim/ and partially realizes the 


"The domain of *HLH must ultimately be more specific because surface HLH sequences are 
permitted within a word in, for instance, inflected verb forms like á-sáná ‘s/he is walking’ 
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overlay by imposing the melody HL on the first syllable of ejim, but the result 
violates “HLH. Candidates (a) and (b) respect PRESERVEHPOSTL and “HLH. The 
winning candidate, (a), delinks the L to avoid violating *HLH but still realizes the 
{HL} overlay better than the faithful candidate (b). The optimal form is thus the 
attested [üsàn éljim]. 

The phrasal morphology account I have presented is able to handle the com- 
plete set of tonal alternations seen in Efik compounds. A potential alternative 
to this account is a purely phonological constraint-based analysis. In the next 
section, I consider what such an analysis might look like. 


3 A constraint-based phonological account 


In working out a phonological account of the tonal alternations in Efik com- 
pounds, I adopt Cook’s (1985) construction marker /H L/, which occurs between 
the two nouns of a compound. The tones of the construction marker are com- 
pelled to surface by REALIZEMORPHEME (Gnanadesikan 1997; Kurisu 2001; van 
Oostendorp 2005). The interaction of REALIZEMORPHEME with other phonologi- 
cal constraints gives rise to the four different surface patterns. 

The tableau in Figure 6 exemplifies the derivation of the surface tones H-L 
(Group 1 Alternation 1) with the compound úfðk ébwa ‘dog house’. Candidates 
(c) and (d) do not realize both tones of the construction marker, so they are elim- 
inated by REALIZEMORPHEME. “HLH, introduced above in the phrasal morphol- 
ogy account, rules out associating both tones of the construction marker to the 
first syllable of ebwa, as in (b), so (a) is optimal. Note that REALIZEMORPHEME 
must be defined as being satisfied only if both tones of the construction marker 
are associated; otherwise, (c) would be as harmonic as (a). This is a departure 
from well-known formalizations of REALIZEMORPHEME that only require that 
some element of the morpheme’s input be realized on the surface to satisfy the 
constraint (Gnanadesikan 1997; van Oostendorp 2005). One might argue that in 
candidate (c) the L of the construction marker is realized insofar as it has a sur- 
face phonological effect, namely, downstep. This would be in the spirit of ap- 
proaches (e.g. Gnanadesikan 1997) in which any detectable phonological effect 
of the morpheme counts as realization. However, for candidate (a) to be chosen 
over candidate (c), REALIZEMORPHEME must require both tones of the construc- 
tion marker to be associated, not merely audible in some way. I return to this 
subject below. 

Next I turn to the surface tones HH (Group 2 Alternation 1). This is one of 
the patterns exhibited by Group 2 nouns, those with the tonal shapes L-H, L-HL, 
or H-!H. The surface pattern H-!H does show the HL melody of the construction 
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úk ébwá/| REALIZEMORPHEME  *HLH 
| | i 
HLHLH H 


= a. ufdk ébwa 


HLHLH H 
b. úfðk ébwa ae 

At 
HLHLH H 
c. uk é'bwá WM 
ZA 
HLHLH H 
d. úk ébwá ei 
| 
HLHLH H 


Figure 6: Tableau for oft ébwa 


marker, but the L is suppressed, appearing only as downstep, while the under- 
lying H after the underlying L is preserved. To derive the behavior of Group 2 
nouns, I make use of the constraint PRESERVEHPOSTL from the phrasal morphol- 
ogy account. 

The surface pattern H-H is exemplified by the compound usan él jim ‘onion 
dish.” Notice that if the H on the second syllable of /éjim/ is preserved on the 
surface, as in [é jim], it means that the L of the construction marker /H L/ has 
not been associated. That is, only one tone of the construction marker is associ- 
ated, meaning that this candidate violates REALIZEMORPHEME as defined above. 
In that case, one would expect the faithful candidate [üsàn éjim], in which none of 
the tones of the construction marker are associated, to win, since this candidate 
also violates REALIZEMORPHEME but is more faithful. Overcoming this problem 
requires a workaround. To ensure that the correct candidate wins, I allow RE- 
ALIZEMORPHEME to count units of the morpheme being compelled to realize, so 
that its evaluation is no longer binary. In this case, the units of the morpheme 
are tones. 

Now the surface tones H-!H can be derived, as shown in the tableau in Figure 7 
for usan él jim ‘onion dish’. Candidate (d) associates one tone of the construction 
marker to each syllable of ejim, but this violates PREsERvEHPosTL. Candidate 
(c) preserves the underlying H after the L by realizing the construction marker 
on the first syllable of ejim, but this violates “HLH. The faithful candidate (b) 
incurs two violations of REALIZEMORPHEME while (a) only incurs one by partially 
realizing the construction marker, so (a) wins. 

If the downstep in candidate (a) could qualify as realization of the floating L 
of the construction marker and therefore exempt candidate (a) from violating 
REALIZEMORPHEME, candidate (b) would no longer need to incur two violations 
of the constraint to lose to candidate (a). The evaluation of REALIZEMORPHEME 
could then remain binary. We saw in the derivation of üf5k ébwa (Figure 6) that 
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T zB di | : 
a : ` 
LLHLIH |& iE 
uid s 
ZF a 
ag 
mi i 
^i iE 
= a. üsàn é'jím D 
+ | 
LLHLLH 
b. üsàn èjim nel 
| | 
LLHLLH 
c. üsàn éjim *! 
"bl 
LLHLLH 
d. üsàn éjim | *! | 
CH 
LLHLLH 


Figure 7: Tableau for ùsàn él jim 


downstep cannot count as realization of the L of the construction marker, though, 
so this attempt to keep REALIZEMORPHEME binary will not work. 

I turn now to the surface tones L-L (Group 1 Alternation 2). This pattern arises 
in the compound íkwá ébwa ‘dog knife’? Group 1 nouns like ébwá ‘dog’ surface 
as L-L instead of H-L when the preceding noun ends in a falling tone HL. For 
some reason, the H of the construction marker does not associate to the first 
syllable of the second noun when the first noun ends in HL. It seems, then, that 
the HL#H (falling # high) sequence is avoided, but this structure does not violate 
“HLH because it is not within a word. Some other markedness constraint must be 
devised to penalize this structure. I therefore put forth the following constraint: 


(5) “HLH<3: Don't have the tonal sequence HLH on fewer than three 
syllables.” 


The constraint specifies that the HLH sequence must occur on fewer than three 
syllables to incur a violation because in a compound like 4f5k ébwa ‘dog house; 
there is an HLH sequence spanning three syllables, and the form is perfectly licit 
(see McPherson 2016 for another *HLH constraint with a configuration restric- 
tion). 


This constraint must also be restricted to some domain, since HLH sequences on fewer than 3 
syllables can arise elsewhere, e.g. on a subject prefix and following verb in #-tjë-yé-tje 15G-sit- 
NEG-FOC ‘I’m not sitting: 
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Realizing the H of the construction marker on the first syllable of the second 
noun when the first noun ends in the contour tone HL would violate "HLH 3. so 
the H of the construction marker changes to L. The tableau in Figure 8 shows how 
the surface tones L-L are derived for the compound íkwá ébwa dog knife’. The 
faithful candidate (e) is eliminated because it violates *HLH<3; it also fails to real- 
ize the construction marker. Candidates (c) and (d) both realize the construction 
marker, but both still violate “HLH<3 (candidate (c) does so twice). Candidate 
(c) also violates *HLH due to the HLH sequence on ebwa. (Note that candidates 
(d) and (e) do not violate *HLH because “HLH, unlike “HLH<3, only applies to 
HLH sequences within a word.) Candidate (b) avoids violating *HLH«3 by asso- 
ciating the second tone of the construction marker to both syllables of ebwa, but 
in not associating the first tone of the construction marker it violates REALIZE- 
MorPHEME. Candidate (a) changes the first tone of the construction marker from 
H to L, violating IDENT(T), but this faithfulness constraint is lower-ranked, so (a) 
wins. (Candidates (a) and (b) are homophonous, so analyses other than the oneI 
have opted for here are possible.) 


likwa ébwá/ 
| A | | 
HHLHiL:H H 


*HLH<3 
*HLH 
REALIZEMORPHEME.| 
ID(T) 


= a. ikwâ ébwa ae * 

he 27 
HHLLiL;H H | j 
b. ikwa ébwa E mper 


HHLH;bHHS| | 
d. ikwá sbwa | *! | 
het 
HHLMiI2H H| | | 
e. ikwa ébwá | #1) Le 
^ | H H 
HHLH:iL;H H 


Figure 8: Tableau for ikwa ébwa 


The last pattern seen in compounds is when the second noun has the surface 
tones L-H (Group 2 Alternation 2). A compound that exhibits this pattern is ikwa 
inwén ‘bird knife’ In this phonological analysis, the surface tones L-H can be 
derived through a combination of the effects of PRESERVEHPOSTL and *HLH-3. 
PRESERVEHPosTL ensures that the underlying H that occurs after a (floating) L in 
Hinwén/ is preserved in the surface form [inwén], and *HLH<3 prevents inwen 
from having an H on its first syllable because after íkwâ this would result in an 
HLH sequence on fewer than three syllables. 
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4 Comparing the phrasal morphology and phonological 
accounts 


The phrasal morphology account I proposed in §2 and the constraint-based 
phonological analysis outlined in §3 are both able to derive the tonal alterna- 
tions seen in Efik compounds, but they differ in their principal mechanism for 
generating the surface tonal melodies. The phrasal morphology account stipu- 
lates two allomorphs of a tonal overlay that is imposed on the second noun of 
compounds: {L} after nouns ending in HL and {HL} after other nouns. The phono- 
logical account posits a construction marker /H L/ between the two nouns of a 
compound. The phonological account is appealing in not requiring the added 
apparatus of constructional schemas and the stipulation of two tonal overlays. 
On the other hand, the phrasal morphology account is appealing in capturing 
the output-orientedness of the patterns; the data give the impression that nouns 
should simply have a certain tonal shape in compounds. McPherson (2014) specu- 
lates that many cases of phrasal morphology are hidden in the literature because 
they have been analyzed phonologically. 

Both the phonological account and the phrasal morphology account are con- 
straint-based, and the constraint sets they use are quite similar. In particular, 
both analyses use PRESERVEHPosTL to capture the difference between Group 1 
and Group 2 nouns' surface tones in compounds. There are drawbacks to the 
constraints used in the phonological account, though. First, the phonological 
analysis requires the evaluation of REALIZEMORPHEME to be non-binary when it 
has most commonly been binary (Gnanadesikan 1997; Kurisu 2001; van Oosten- 
dorp 2005; Wolf 2007). That is not to say non-binary evaluation is unprecedented. 
Samek-Lodovici's (1992) original formulation of the morpheme realization con- 
straint, AFFIX REALIZATION, required each specification of an affix to be realized 
and counted one violation for each unrealized specification. Additionally, Wolf 
(2007) proposes a constraint MAXFLT that accomplishes the work of REALIZE- 
MORPHEME but also requires that all floating autosegments in the input have 
output correspondents. Trommer (2012), however, argues that a morphological 
reanalysis of the data Wolf uses to justify MAXFLT makes the more powerful 
MaxFrr unnecessary. Therefore the more restrictive, binary REALIZEMORPHEME 
should be retained. 

A full discussion of the proper formalization of morpheme realization con- 
straints is beyond the scope of this paper. For now, I point out that if REAL- 
IZEMORPHEME requires all elements of a morpheme to be realized and counts 
a violation for each unrealized element, as it does in the phonological account, 
this opens the door to further complications. For instance, when the construction 
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marker is realized by its tones being associated to the second noun of the com- 
pound, should the delinked tones of the second noun (e.g. the two delinked Hs 
of ebwa in candidate (a) in Figure 8) also incur violations of REALIZEMORPHEME? 
If not, why not, when those tones are units of the morpheme consisting of the 
second noun of the compound? 

It should be acknowledged that the phrasal morphology account also requires 
non-binary constraint evaluation, in this case of the constructional schema con- 
straint N MII This is a departure from McPherson (2014). That said, the non- 
binary evaluation of N N (HI is well defined: one violation is incurred per surface 
tone in the output that does not match the tonal overlay. The non-binary eval- 
uation of REALIZEMORPHEME, in contrast, raises further questions about how it 
should apply. 

Another drawback of the phonological account is that it requires the constraint 
"HLH-3 in addition to the constraint "*HLH. *HLH<3 seems to duplicate *HLH 
but must specify a domain of two syllables or less while also being allowed to 
apply across word boundaries. As discussed above, "HLH seems well-motivated 
for Efik, but there is no independent motivation for "HLH«3. Moreover, it would 
have to be restricted to compounds because HL#H sequences are permitted in 
other constructions involving two adjacent nouns, as in the genitive construction 
in (6) and the double object construction in (7): 


(6) Genitive construction 
awa éjin 
cat child 
‘the child's cat? 


(7) Double object construction 
rh- má Gun awa íják 
1SG-PAST.AUX 18G-give cat fish 


‘I gave the cat a fish’ 


As pointed out by a reviewer, having to restrict *HLH<3 to compounds would 
cease to be a problem if we considered a compound to be a single phonological 
word. In that case, we could simply say that *HLH<3 applies within words. The 
two nouns composing a compound would then be subject to *HLH«3 while the 
two nouns composing a genitive construction or a double object construction 
would not be. This solution will not work, however, because "*HLH«3 must also 
apply to adjective-noun constructions, which are not the focus of this paper but 
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which also exhibit nominal tonal alternations. The example in (8) illustrates how 
“HLH <3 is active in adjective-noun constructions: 


(8) /ékpri ébwá/ — [ékpri ébwà] 
little dog 
‘little dog’ 


The adjective ékpri ‘little’ ends in a falling tone, and like nouns ending in 
falling tone it triggers Alternation 2 on a following noun. Thus the underlyingly 
H-H noun ébwá surfaces with the tones L-L. If ebwa had retained its initial H after 
the final HL of ékpri, the phrase would violate *HLH<3. While compounds may 
constitute a single phonological word, it is unlikely that an adjective followed by 
a noun would. Consequently, the elegant solution of restricting "HLH-3 to the 
domain of the phonological word is unavailable. Instead, we would have to some- 
how define *“HLH<3 as applying to compounds and adjective-noun constructions 
but not to genitive constructions and double object constructions. 

Regarding *HLH<3’s duplication of *HLH, one might argue that *HLH<3 suf- 
fices to derive the tonal alternations in compounds and that *HLH can be gotten 
rid of. The losing candidates that violate “HLH in the tableaux in $3 all violate 
"HLH-«3 as well because the HLH sequence, even when it is within one member 
of the compound, never occurs on more than two syllables. However, it is not 
the case that a within-word HLH sequence is only illicit on fewer than three syl- 
lables in Efik. In reduplicated verb forms, HLH sequences in which each tone is 
associated to a different syllable are repaired (Glewwe 2017). For instance, a verb 
form that would otherwise be expected to surface with an HLH sequence on the 
final three syllables in fact surfaces with an HLL sequence, as exemplified in (9) 
(the underlying form of the verb root ‘bend down’ is /nüy$/): 


(9) i-kí-hnó-nüy5 *i-ki-nó-nüàyó 
1PL-NEG.PAST.AUX-LEX-bend.down 


‘we bent down 


Evidence from contrastive verbal reduplication shows that the more general 
“HLH is still needed in the grammar of Efik, regardless of whether it is used 
in the analysis of the tones of Efik compounds. Thus the grammar would still 
have to contain two "HLH constraints, the broader *HLH and the more restricted 
"HLH«3. 

To sum up, the phonological account relies on an unusual and potentially prob- 
lematic definition of REALIZEMORPHEME. It also requires an additional constraint 
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that the phrasal morphology account does not, namely, *HLH<3. This constraint 
seems ad hoc and is not well motivated for Efik. Broadly speaking, the phono- 
logical account resorts to dubious constraints to explain the full range of tonal 
alternations with just the construction marker /H L/ while the phrasal morphol- 
ogy account allows two allomorphs of the tonal overlay to capture what seem 
to be the two different surface melodies (HL and L) that arise in compounds. I 
therefore favor the phrasal morphology account over the phonological account. 

That said, many of the types of evidence McPherson (2014) uses to argue for 
the phrasal morphology account in Dogon languages, such as long-distance im- 
position of tonal overlays or competition between two words seeking to impose 
their overlays on an intervening word, are not available in Efik. Further investi- 
gation of Efik nominal tonal alternations may yield additional support for one ac- 
count over the other. In this paper, I have focused on Efik noun-noun compounds, 
but nouns in genitive constructions and adjective-noun phrases also exhibit the 
same surface melodies seen on the second noun in compounds, though they dif- 
fer in which surface melodies arise in which environments. Exploring how both 
the phrasal morphology account and the phonological account can be extended 
to the tonal alternations in these other Efik nominal constructions could shed 
light on their relative merits. Finally, it would also be worthwhile to examine 
surface tonal patterns in longer phrases containing multiple targets for the tonal 
alternations. 
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In this paper, we trace the development of Proto-Bantu noun classes into Teke 
(Bantu B71, Ewo dialect), showing that formal reflexes of classes 1, 2, 5-9, and 14 
are detectable. We further show that animacy, abstractness, and number allow us 
to determine the fate of classes 3, 4, 10, 11 and identify the following singular/plural 
genders: 1/2 (animate «PB 1/2, some 9/10), 1/8 (inanimate, «PB 3/4), 14/8 (abstract, 
«PB 14/8), 5/6 («PB 5/6), 5/9 («PB 11/10, with 10-9 merger), 7/8 («PB 7/8), and 9/6 
(«PB 9/6). Such reassignments provide a window into probing parallel noun class 
changes in other Northwest Bantu and Niger-Congo in general. 


1 Introduction 


In this paper we have two goals. First, we trace the development of the Proto- 
Bantu (PB) noun classes into a variety of Teke, a group of closely related, un- 
derstudied B70 languages spoken in Gabon, the Republic of the Congo, and 
the Democratic Republic of the Congo. Second, we discuss how the Teke facts 
provide a window into probing parallel noun class changes in other Northwest 
Bantu, Bantu, and Niger-Congo (NC) in general. In this sense we provide an addi- 
tional contribution and comparison with past work on the restructuring and loss 
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of NC noun classes. This includes, among others, the considerable work on merg- 
ers and loss in Bantoid (cf. the papers in Hyman 1980 and Hyman & Voorhoeve 
1980) and Cross-River (Williamson 1985; Faraclas 1986; Connell 1987; Hyman & 
Udoh 2006). Of particular interest will be the restructuring which takes place on 
the basis of animacy, something discussed at great length in Northeastern Bantu 
(Wald 1975; Contini-Morava 2008) and elsewhere in Bantu (Maho 1999: 122-126). 
We will show that both phonetic and semantic factors have played a role in the 
changes which have taken place between PB and Teke. All of the above - and 
more - is covered in very careful detail in Good (2012). 

Crucial to the approach taken here is that synchronic noun classes and genders 
(singular/plural pairings) are established by concord (agreement markers), not by 
affixal marking on the noun itself. On the other hand, as pointed out by several of 
the above studies, attention must be paid to both marking on the noun as well as 
on agreeing elements. Our attention is on the Ewo dialect of Teke B71 (Republic 
of the Congo), as spoken by the third author, reporting on a several month study 
together in Berkeley in Spring 2016. We begin by considering the situation in PB 
in 82, then turn to Teke in $3. The changes which have taken place between the 
two are enumerated in 84, followed by a presentation of our conclusions in $5. 


2 Proto-Bantu noun classes 


The natural starting point for this kind of study is the Proto-Bantu noun class sys- 
tem, both noun prefixes and (pronominal) concordial elements, which Meeussen 
(1967: 97) identifies as shown in Table 1.1 

On the basis of the reconstructions, we can make the following observations: 
(i) Noun prefixes all have L(ow) tone. (ii) Pronominal concord is H(igh) except for 
(shaded) classes 1 and 9 which are L. (iii) As indicated, some class pairings show 
some semantic consistency, e.g. 12, 13 and 19 are diminutive classes. In addition, 
class 6 mà- is also used for mass/liquids, and 16, 17, and 18 are locative classes. In 
short, at least 19 distinct noun classes can be reconstructed in PB. 


‘Abbreviations used in the tables are listed at the end of this paper. Vowels are transcribed using 
IPA symbols, rather than Meeussen's (1967) symbols: /i, 1, u, o/ rather than /i, i, u, u/. Note that 
we choose to reconstruct the concordial prefix of class 1 as *s- rather than to adopt Meeussen's 
(1967)’s *ju-, in which the j- might be a confusion with class 9. 
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Table 1: Proto-Bantu noun classes and genders 


Class NPfx Co Clas NPfx Co Sg./Pl. genders 
1 "me, Mo 11 *lo- *lo- 1/2 (humans) 
2 “bà-  “bé- 12 “kà- “ka- 3/4 
3 *moó- ^"go- 13 "o  "tó- 5/6 
4  *mr "et 14  *bó- "be 7/8 
5 *- "lr. 15 *ko-  *ko- 9/0 (incl. animals) 
6 *"mà-  "gá- 16 "pà- “pa- 11/10 
7 *ki- *gi- 17 "ko "ko 12/13 (diminutives) 
8 *bi- *bí- 18 "mz, "me 14/6 (abstract) 
9 *N [ME 19 nm "pt 15/6 
10 *N- *ji- 19/13? (diminutives) 
Table 2: Noun class reflexes in Teke (Ewo dialect) 
PB NPfx As | Prox Dist ‘two’ SPr SAgr/ C SAgr/ V 
1 o-,N-,@- wa!) wù wa — ndé Ø Ø 
2 à- bá bà bâ bvwóólè bó á bá 
5 le-,@ lé li là — ló lé lé 
6 à- má mà má  mbvwóólè mó á má 
7 kè- ké | ki kyà — kó ké ké 
8 è- bé bì bya  dziéélè jó é bé 
9 NC),@ yè yi ya  yiéélè yo é yé 
14 ò- bó ba bâ = ndé Ø Ø 
Table 3: Teke genders (sg./pl. pairings) 
Gender PB cl. Sg. pfx Pl. pfx Prox (sg./pl.)  # Semantics 
wa/ba 1/2 o N-, Ø- à wü/bà 79 animate, human 
wa/bé (1/8) ò- é- wu/bi 52 inanimate 
bó/bé (14/8) ò- è- bà/bi 18 abstract 
lé/má 5/6 le- à- li/ma 92 
lé/bá (5/2) lè- à- li/ba 1 ‘bird’ 
lé/yé (5/9) lè- N- li/yi 12 
ké/bé 7/8 kè- è- ki/bi 54 
yé/má (9/6)  N(-), Ø- à- yi/mà 45 
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3 Teke (Ewo dialect) 


The situation is quite different in Teke.? The forms found in the Ewo dialect are 
presented in Table 2. The PB class numbers in the first column are given for 
reference. Note that it is not clear whether the initial nasal consonant of class 9 


nouns should still be analyzed as a prefix in Teke, or whether it has fused with 


the root Ge, class 9 nouns do not have a prefix anymore). This will accordingly 


be shown in all transcriptions of class 9 nouns with a hyphen in parentheses 


between the potential nasal prefix and the root. 
From these tables the following observations and additional facts can be 
noted:? 


(i) 


(vi) 


Class 3 merged with class 1. We know that the form is historical class 1 
because of the L tone associative (only classes 1 and 9 had L tone concord 
in PB). 


Class 4 merged with class 8, thus producing a 1/8 gender (corresponding 
to PB 3/4). 


The Teke reflexes of PB class 9 is used both as a singular (with a class 6 
plural), but also as the plural of class 5 (from class Proto-Bantu class 11, 
see below), thus producing the two genders 9/6 and 5/9. Again, we know 
that the plural form is a reflex of class 9 because of its L tone associative 
(a reflex of Proto-Bantu class 10 would have a H tone). 


Class 11 merged with class 5. Its plural is now in class 9, hence a 5/9 gender 
(see above). 


PB diminutive classes 12, 13 and 19 and locative classes 16, 17 and 18 are not 
present in Teke. 


Of the eight singular/plural genders in Teke, those not occurring in PB are 
in parentheses (Maho 1999: 255-261). As seen, most genders are innova- 
tions (five out of eight), as schematized in Figure 1 (the numerals refer to 


? Although Teke languages often have seven (or more) phonetic vowels, e.g. Kukuya (Paulian 


1975), we did not find a phonemic contrast in the mid vowels, which we transcribe with e and 


0, pronounced as such in noun class markers and other grammatical morphemes, otherwise as 
[e] and [ə]. 

"Note, additionally, that the plural of class 14 nouns is now in class 8 instead of the original class 
6 of proto-Bantu, depsite the fact that class 6 still exists in Teke. We do not have an explanation 
for this change. One may surmise that the similarity of the singular ò- of 1/8 (from Proto-Bantu 


3/4), inanimate, may have caused an analogy. 
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the PB noun classes that the modern Teke classes correspond to histori- 
cally; the numerals in parentheses refer to the PB classes that merged with 


other classes). Examples of each gender follow in Table 4. 


Singular 
1(+3) wa 
14 bó 
7 ké 


5 (+11) lé 
9 


ye 


Plural 
ba 


bé 
ma 
ye 


Figure 1: Teke genders 


(+4) 


(<10) 


As can be seen in the shaded column of Table 2, at most eight distinct classes 
are recognizable, which we identify by their associative marker. The singular/ 
plural pairings (“genders”) are presented in Table 3, where the number (#) indi- 
cated for each gender is based on a lexicon of 356 singular/plural nouns. 

With this established, we now turn to consider how Teke derived from PB. 


Table 4: Examples of each Teke gender 


Teke PB Singular ` Plural 

wa/ba 1/2 mwáàánà  à-bà4nà ‘child’ 
ò-kúúlù à-kúúlù ‘uncle’ 
n-dzia a-ndzia ‘stranger’ 
n-dz06 a-ndzoo ‘elephant’ 

wa/bé (1/8  d-bâ e-bá ‘palm tree’ 
ó-mbónó ^ é-mbónó “leg 
ó-nywà é-nywà ‘mouth’ 
ó-kilà é-kilà ‘tail’ 

bó/bé (14/8) ó-yüáü é-yuu ‘poverty’ 
ó-dzá é-dzá ‘food’ 
ó-bvwóó ` é-bvóó ‘fear’ 
ó-nsámbá  é-nsámbá ‘judgment’ 

lé/má^ 5/6 lé-lémi à-lémi ‘tongue’ 
kélé à-kélé ‘stone’ 
dziini miini ‘tooth’ 
dziiri mbiiri ‘eye’ 

lé/bá | (5/2  ]lé-nyóní  à-nydni ‘bird’ 
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Teke PB Singular Plural 


lé/yé (5/9)  lè-nkii n(-)kii ‘neck’ 
lé-sálá n(-)tsálá ‘feather’ 
lé-ntséré n(-)tséré ‘straw’ 
lè-ndèli n(-)déli ‘beard’ 

ké/bé 7/8 kè-kài é-kài ‘hand’ 
ké-kala é-kala ‘mat’ 
ké-baa é-baa ‘wall’ 
ké-yirí é-yirí ‘bone’ 

yé/má (9/6)  n(-)dzó à-ndzó ‘house’ 
bi a-bi ‘egg 
n(-)dzálí à-ndzálí ‘river’ 
m(-)baa a-mbaa ‘fire’ 


4 From Proto-Bantu to Teke 


As summarized in §2, the noun classes inherited from PB have undergone a num- 
ber of mergers. PB classes 3, 4 and 11 all merged their noun and agreements with 
classes 1, 8 and 5, respectively. Class 14, on the other hand has merged its *bo- 
prefix with class 1 (and 3) 0-, but maintains a separate agreement. Similarly, PB 
class 2 *bá- and class 6 *mà- have merged their noun prefix as a-, but maintain 
distinct agreements. It is likely therefore that the noun prefixes merged first, and 
later their agreements. We survey these changes in this section. However, we 
first begin by considering the three genders that were inherited directly from PB. 
Table 5 presents examples of PB and later regional reconstructions, as well as 
their current reflexes in Teke.? 

As seen in Table 5, the major change has been the loss of the initial consonant 
of PB class 2 *ba-, class 6 *ma- and class 8 “br-. The Teke 5/6 examples show 
that class 5 nouns can be marked by lè- or Ø. Nouns in 5/6 are roughly equally 
divided; those in 5/9 always take lé-, since they all derive from PB class 11 *ks-. 


“Gender lé/má (5/6) contains a few nouns whose class can be identified through initial con- 
sonant alternation rather than the regular Teke prefixes lè- and d-, e.g. dziini/miini ‘tooth’, 
dziiri/mbiiri ‘eye’. Such words are rare, and constitute exceptional forms in the language (we 
have not been able to identify any phonological conditioning). Note that the plural of ‘tooth’ 
mbííri starts with a (historically unexpected) [mb] cluster rather than the expected [m]. 

5Proto-Bantu reconstructions are taken from Bastin et al. (2002) (noun roots), and Meeussen 
(1967) (noun class prefixes, cf. Table 1). 
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Table 5: Genders inherited from Proto-Bantu (Pfx-Noun + Assoc.) 


PB (sg./pl.) Teke (sg./pl.) 
“1/2 1/2 wa/ba 
*mo-kádró-  *bà-kädi bá- ò-kálí wà  à-käli bá ‘woman’ 
*mo-già ó- ^ "bà-già bá- ó-yiàwà = à-yiàbá ‘slave’ 
*5/6 5/6 lé/ma 
*j-jíco lí- *mà-jícó gá- dziiri lé mbiirima ‘eye’ 
"Ac li- *ma-jui gá- dzüi lé à-dzüi-má ‘voice’ 
*j-kájá lí- *mà-kájá gá- lé-káyàlé ^ à-káyà má ‘tobacco’ 
"7/8 7/8 ké/bé 
*ki-dibà gr- — *bi-dibà bí- ké-dià ké  è-dià bé ‘pool’ 
*ki-gàdá gi- *bì-gàdá bí- ké-kala ké — é-kàlá bé ‘mať 


While the above genders have been stable, four class mergers directly explain 
two of the new genders: The first, gender 1/8 wa/bé, is the formal merger of *3 > 
1 and *4 > 8. Thus, as seen in Table 6, PB 3/4 now corresponds to Teke 1/8 wa/bé: 


Table 6: Gender wa/bé (*3/4 > 1/8) 


PB (sg./pl.) Teke (sg./pl.) 
*3/4 1/8 wà/bé 
*mo-noà gó- *mi-noà gi- ó-nywà wä é-nywabé ‘mouth’ 
*mo-gonda gö-  "mi-gondà gr- ó-küünà wà  è-küünà bé ‘field’ 
*mo-kida gù- *mi-kidà gr- ó-kílà wa é-kila bé "tail 
*mo-trmà gò- *mi-trmà gr- d-timà wa é-tímà bé ‘heart’ 


Similarly, gender 5/9 lé/yè derives from the merger of *11 > 5 and *10 > 9 (Ta- 
ble 7). 


*The same *10 > 9 merger seems to have occurred in Latege, another B71 dialect (Ruth Raha- 
rimanantsoa and Pauline Linton, p.c.), but not in Kukuya (Paulian 1975), Ngungwel, or Eboo 
(Ruth Raharimanantsoa p.c.). This merger thus seems to be a characteristic of B71 dialects only. 
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Table 7: Gender lé/yè (*11/10 > 5/9) 


PB (sg./pl.) Teke (sg./pl.) 
*11/10 5/9 lé/yè 
*Jo-dédüló- *N-dèdù yi lé-ndélilé  n(-)dèli yè (*yé) ‘beard’? 
*Jo-cádálo- ^ *N-cada yi lè-sálá lé ^ n(-)tsálá yè (“yé) ‘feather’ 


“Note that the initial [n] in the singular form lé-ndéli might be an indication that the singular 
was also initially in class 9, and then reassigned to class 5. 


As discussed above, these mergers appear to be the result of regular sound 
changes affecting noun prefixes, e.g. the systematic loss of prefix-initial [b] or 
[m] (prefix-initial [1] and [k] are not affected), followed by the realignment of 
agreement patterns, as illustrated in Table 8 for the *4 > 8 merger:’ 


Table 8: Hypothesized steps of *4 > 8 merger 


( 8 è-kidà bé 


4 *mi-kidà gr- > 41-kida gr "E 
(.)»  8é-dià bé 


8*bi-dibà bí- >  81-dibà bí- 


> 
> 


The origin of 5/9, and 9/6 can be traced back to class/gender reassignment 
following the consequences of the “10 > 9 merger, i.e. the loss of a number dis- 
tinction for N- initial nouns. This again shows the importance of a prior prefix 
merger in motivating changes in noun class assignments. Former *9/10 nouns 
could have become a number-insensitive 9/9 gender, but did not. Instead, the "10 
> 9 merger led to a class/gender reassignment based on the semantic property of 
animacy. Animate “9/10 nouns were reassigned to 1/2 wa/ba, merging with the 
human nouns in that class, as shown in Table 9.8 

As also seen in Table 9, one noun, lè-nyòní ‘bird’, shifted into gender 5/2 lé/bá. 
Note that this is the only noun illustrating both this shift, and the inquorate 


The fact that only prefix-initial labial consonants are targeted does not necessarily contra- 
dict the Neogrammarian principle of sound change regularity: stem-initial prominence, which 
plays an important role in Teke and more generally Northwestern Bantu languages (cf. Pau- 
lian (1975), Hyman (1987), Idiatov & Van de Velde (2016), a.o.) is not unlikely to have protected 
stem-initial consonants (and possibly other non-prefix consonants?) from this change. A de- 
tailed account of such sound changes in Teke is, however, outside the scope of this paper. 

*Most of the former class 9 nouns that were reassigned to class 1 or class 5 start with a nasal 
consonant, which is a trace of the former nasal prefix of class 9, now part of the root. We have 
indicated the historical origin of this nasal consonant as a prefix with a hyphen in parentheses. 
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gender 5/2 lé/bá. On the other hand, inanimate *9/10 nouns either became 9/6 
yé/má (plural reassignment only) or 5/6 lé/má (complete gender reassignment), 
as shown in Table 10. 


Table 9: Animate “9/10 > 1/2 wa/ba (+ 1 case of 5/2 lé/bá) 


PB (sg./pl.) Teke (sg./pl.) 
*9/10 1/2 wa/ba 
*N-nàmà yi ` *N-pama yí- nyàmà wà à-nyàmà bá ` ‘animal’ 
*N-jógo yi- *N-jogo yí- n(2dzóó wa à-ndzóó ba ‘elephant’ 
*N-boà y1- *N-bóà yí- m(-)bvà wà à-mbvà bá ‘dog’ 
*N-gombé yi-  "N-gómbe yí- n(Jgómbé wà a-ngdmbé ba ‘cow’ 
*N-gàndo yi- — *N-gàndo yí- n(Jgàndíwà à-ngàndíbá ‘crocodile’ 
*N-gói yi- *N-gói yí- n(Jgó wà à-ngó bá "leopard' 
*N-gombá yi-  *N-gombá yí- n(Jgüümà wa  à-ngüümà bá ‘porcupine’ 
*N-kímà yi- *N-kímà yí- n(-)kima wa a-nkima ba ‘monkey’ 
*N-gubs yi- *N-gübo yi- n(Jgübü à-ngübü bá ‘hippo’ 
*N-poko yi- *N-poko yí- m(-)puu wa à-mpüü ba ‘rat’ 
*N-pidi yi- *N-pidi yí- m(-)pili wa a-mpili ba ‘snake sp: 
*N-cüi yi- *N-cüi yí- n(-)tsû wa à-ntsü bá ‘fish’ 
5/2 lé/bá 
*N-joni y1- *N-joni yí- lé-nyóní lé à-nyóní bá "bird 
Table 10: Inanimate *9/10 > 9/6 yé/má or 5/6 lé/má 
PB (sg./pl.) Teke (sg./pl.) 
*9/10 9/6 ye/ma 
*N-jàdà yi- *N-jàdà yí- n(-)dzala yé a-ndzala ma ‘hunger’ 
*N-jongo yi- — *N-jongo yí- n()dgüngü yè  à-ndgüngü mä ‘pot’ 
*N-jidà yi- *N-jidà yí- n(-)dzila yé à-ndzilà má ‘path’ 
5/6 lé/ma 
*N-gi yi- N-gi yi- lè-ngingi lé à-ngingi má ‘fly’ 
*N-dongo yi- | N-dóngo yí- lè-ndúú lé a-nduu ma ‘pepper’ 
*N-tódó yi- N-tódo yí- lé-ntálà lé à-ntülü má ‘chest’ 


In addition, a few “9/10 inanimate nouns became either 1/8 wa/bé or 5/9 lé/yé, 
as can be seen in Table 11. 
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Table 11: Inanimate “9/10 > 1/8 wd/bé or 5/9 lé/yè 


PB (sg./pl.) Teke (sg./pl.) 
*9/10 wa/bé 
*N-gòdí yi- ^ N-godi yí- ò-ngòrí wà è-ngòríbé ‘liana’ 
lé/yè 
*N-jogo yi- N-juge yí- lè-ndzú lé n(-)dzú yè = ‘groundnut’ 
*N-kíngó yi- N-kíngó yí- lè-nkíí lé n(-)kii yè ‘neck’ 


Finally, a few former “9/10 nouns alternate between 5/6 lé/má and 5/9 (< *10) 
lé/yè, e.g. lé-mpàmbü lé / à-mpämbü ma ~ m(-)pàmbü yè ‘worm’. Note that all 
former “9/10 nouns reassigned to 1/2 wa/bd, 5/6 lé/má, 9/6 yé/má, 1/8 wa/bé, or 
5/9 lé/yé have kept the historical N- prefix. The cause of all of the above *9/10 
class/gender reassignments is presumably the need to maintain a singular/plural 
distinction, with animacy exploited as the guiding criterion for reassignment. 

Animacy plays a potential role in other places in the Teke noun class sys- 
tem. Recall that singular nouns marked by the prefix ò- today may represent 
the merger of PB *1 and *3 (with the same agreements) or class *14 (with its dis- 
tinct agreements). From the semantics one can almost perfectly predict whether 
an à-prefixed noun will be in gender 1/2, 1/8 or 14/8. As before, animate nouns 
will all be in 1/2. Inanimates will either be in 14/8 bó/bé if they represent an ab- 
stract quality (as in PB *14), otherwise in 1/8 wa/bé. Representative examples are 
provided in Table 12. 


Table 12: Animacy-based gender assignment of ò- nouns 


Animate — 1/2 wa/ba 

ó-lümi ‘husband’ — o-limiwa / a-lumi ba 

d-tèé ‘Teke person’ — à-tèé wá / à-tèé ba 
Abstract — 14/8 bó/bé 

ó-bvwóó ‘fear’ — ò-bvwòóbó /  é-mbvwóó bé 

ó-yüü ‘poverty’ —  óOó-yüü bó /  é-yáü bé 
Concrete inanimate — 1/8 wa/bé 

ó-bá ‘palmtree’ — 06-bawa / é-bábé 

o-sia ‘rope’ — o-siad wa /  é-sià bé 
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Table 13 shows the number of animate nouns that occur in each gender. 


Table 13: Genders and animacy [bracketed number = nouns with hu- 
man referent] 


Animate [incl. human] Inanimate Total 
12 wa/ba '78[84] 6 79 
5/2 lé/bá 1 0 1 
18 wa/bé 5 [1] 47 52 
14/8 bó/bé 0 18 (abstract) 18 
5/6  lé/má 9 (insects, ‘frog’, ‘tortoise’) | 83 92 
5/9 lé/ye 2 10 12 
7/8 ké/bé  11[5,kin] 43 54 
9/6  yé/má 0 45 45 

Total: 353 


As seen in Table 13, the total number of animates is 101 out of 353 total nouns. 
Of these 101, 73 occur in 1/2 wd/bà. In fact, virtually all humans are in 1/2. Of 
the rest only 28 animate nouns occur outside 1/2. Interestingly, no animate noun 
has a class 9 yè singular (gender 9/6 yé/má). All PB “9/10 animate nouns were 
reassigned, mostly to 1/2 wa/ba. 

To conclude this section, we note with considerable interest the variation in 
former *11 (and some *9) nouns that have been reassigned to class 5 lé. These have 
kept the former class 10 plural N- form, even though it has the L tone agreement 
yè of *9. However, as we have noted, an N- noun is ambiguous in terms of number, 
and may be interpreted either as singular or as plural. It can be the plural class 
9 of a class 5 lé singular (from *11) or the singular of 9/6 yé/má and 1/2 wá/bá. In 
addition, nearly half of class 5/6 nouns alternate between a prefixed lè- and a Ø 
or N- singular form, approximately half in our lexicon occur without /é-. (Recall 
that all 5/9 nouns require lè- on their singular.) This is illustrated in Table 14. 


Table 14: lè- vs. Ø sg. prefix in 5/6 lé/má nouns 


Optional Ø 
(lè-)mpéi lé /  à-mpéimá ‘chin 
(lé-)sàánilé /  à-sàánimá ‘plate’ 
Obligatory 2 
("lé-)kfürülé /  à-kfürü má ‘hole’ 


(“lé-)bila lé /  à-bilà má ‘leprosy’ 
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As a result this has produced several cases where the same N- noun can be 
interpreted as either singular or plural, paired with an appropriate noun class of 
opposite number, as shown in Table 15. 


Table 15: N- nouns as singular or plural 


Plural class 9 yé (singular = class 5 lé) 
lé-mbàlàlé / m(-)balayé ‘civet cat’ 
lè-nkii lé /  n(-)kii yè ‘neck’ 

Singular class 5 lé (plural = class 2 bá if animate) 
m(-)bàlàlé  /  à-mbàlà bá ‘civet cat’ 


Singular class 5 lé (plural = class 2 má if inanimate) 
n(-)kii lé / à-nkii má ‘leprosy’ 


5 Conclusion 


As seen above, an identical prefix shape can not only lead to merger of noun 
classes (e.g. class *1 and “3, *4 and *8, *5 and *11), but can cause a noun to function 
in two different genders, one as a singular, the other as a plural. This too can be 
expected to lead to further realignments as the noun classes prepare for their 
next move. 

In the preceding sections, we have seen that the Teke noun class system has 
undergone important restructuring with loss of eleven of the nineteen PB classes, 
four class mergers, and many gender reassignments. As we have shown, only 
three out of eight genders are inherited from PB. Three variables have played 
an important role in this evolution: (i) prefix shapes; (ii) animacy; (iii) number. 
Number and animacy have played a major role in this restructuring, in particular 
in the class and gender reassignment of PB *9/10 nouns. These become 1/2 waá/bá 
if animate, 5/6 lé/má or 9/6 yé/má if inanimate (occasionally also 1/8 wa/bé and 
5/9 lé/yé). Animacy also plays an important role in synchrony. As we have shown, 
singular ò- is interpreted as 1/2 wàá/bá if animate, 14/8 bé/bé if abstract, and 1/8 
wa/bé if concrete inanimate. In addition, a noun with the prefix sequence à-N- is 
unambiguously class 2 bá if animate, class 6 má if inanimate. The relevance (and 
potential conflict) of animacy in the synchronic and diachronic marking of noun 
classes is attested elsewhere in Bantu (Wald 1975; Maho 1999; Contini-Morava 
2008; among others), even to the extent of entirely replacing the inherited noun 
class system, as in Nzadi (Crane et al. 2011). What is particularly interesting in 
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the Teke case is the conspiracy between prefix shape and animacy. Noun classes 
are reassigned on the basis of animacy. As prefixes merge, noun class agreements 
merge, even those accompanying singular and plural 9/10. This shows that Teke 
speakers are paying attention not only to the semantics, but are impressively 
influenced by the forms. Such interplay in the reassignments which we have 
enumerated should be considered in probing parallel noun class changes in other 
Northwest Bantu and Niger-Congo in general. 


Abbreviations 

As Associative Prox proximal demonstrative 
Co concord ‘this/these’ 

Dist distal demonstrative ‘that/those’ SA Subject agreement 
NPfx noun prefix SPr Subject pronoun 
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Chapter 6 


Tone, orthographies, and phonological 
depth in African languages 


Michael Cahill 


SIL international 


Marking of tone in African orthographies has historically been a challenge, not 
only for linguistic and analytical reasons, but also because most designers of these 
orthographies have been educated in non-tonal languages. After a review of lexical 
vs. grammatical tone, this paper examines various strategies that have been used 
for marking both lexical and grammatical tone in several East and West African 
languages, as well as cases in which tone is not marked. The question of the de- 
sired phonological depth of an orthography is discussed, especially when applied 
to tonal processes. Many phonologists do not apply theory more recent than Chom- 
sky and Halle & Chomsky (1968) to orthographies. However, the more recent bi- 
furcation of rules into lexical and postlexical provides a psycholinguistically sup- 
ported phonological level at which tone marking can be based: the output of the 
lexical level. Experimental evidence supports this LEXICAL level as more readable 
than either a PHONEMIC or a DEEP level. A tonal typology of languages also guides 
what types of languages more predictably would need lexical tone marking. Rec- 
ommendations for orthographical implementation are given in the conclusion. 


1 Introduction 


Marking of tone in African orthographies was considered problematic even be- 
fore the 1928 Rejaf Language Conference, where permission was rather grudg- 
ingly given to mark tones in Sudanese languages when absolutely necessary: “For 
tonal representations, the consensus was that only high tones should be marked, 
with an acute accent, and only if necessary for a particular language” (Miner 
2003). 

One reason for this rather tepid approval was that most developers of orthogra- 
phies either were Europeans or were educated in European languages, which 
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of course are not tonal. The result was that many writing systems for African 
languages avoided tone marking, and tone was often not studied in any depth. 
Matters improved only somewhat two years after Rejaf with a cross-continental 
proclamation: 


In books for Africans, tones, generally speaking, need only be marked when 
they have a grammatical function, or when they serve to distinguish words 
alike in every other respect; and even then they may be sometimes omitted 
when the context makes it quite clear which word is intended. As a rule, it 
will suffice to mark the high or the low tone only. (International Institute 
of African Languages and Cultures 1930: 14, referring to Rejaf and 12 other 
documents) 


This guidance sounds strikingly modern, both in what it says and does not say. 
Note that this statement specifies books “for Africans,” not for foreigners, so it 
primarily has local literacies in mind. It laudably distinguishes grammatical from 
lexical tone, and for the latter, advocates what is called “selective tone marking” 
today - marking tone only on minimal pairs, and even then, only when they 
are words likely to be confused in context. Tone marking is still considered a 
challenge today. It is not uncommon for orthography developers to not mark 
tone at all, either for principled reasons, or because they cannot deal with it, 
or because they do not consider it important (see Cahill 2000 for a critique of 
omitting all tone markings). 

This paper begins (82) with a review of the distinction between lexical and 
grammatical tone. $83 examines methods that have been used to represent both 
lexical and grammatical tone (or not) in various African orthographies. In $4, I 
examine two major topics for assisting decisions in tone marking: the appropri- 
ate phonological level for orthographies, and a two-fold typological division of 
African languages. I close in $5 with some recommendations for representing 
tone in African orthographies, and a brief re-examination of the selective tone 
marking issue. 


2 Lexical vs. grammatical tone: Review 


Lexical tone is a difference in pitch that distinguishes one lexeme from another. 
Samples of this are given in (1).! 


1I follow a common notation for tone transcriptions that indicates tone levels with various 
diacritics: 4 = high, à = low, à = mid, à = falling, 4 = rising, and 'á = downstepped high. An un- 
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(1) Lexical tone differences in nouns 


a. Konni (Cahill 2007: 306) 


kpaan kpa'an kpaan 

‘oil’ ‘guinea fowl’ ‘back of head’ 
b. Mono [mnh] (D. R. Congo) (Olson 2005: 198) 

awa awa awa 

‘diarrhea’ ‘road’ ‘fear’ 


Grammatical tone, on the other hand, distinguishes one grammatical category 
from another. There are many grammatical categories which can be thus distin- 
guished. Some of the more common ones are given in (2-6) and Table 1. Not 
every person distinction is differentiated by tone in these or other languages; it 
is typically only two pronouns of the set that are so distinguished. 


(2) Person distinguished by grammatical tone 
a. Jur Modo [bex] (Sudan) (Persson 2004: 80) 


ni ni 
‘her’ ‘their’ 
b. Lyele [lee] (Burkina Faso) (Kutsch Lojenga 2014: 57) 
ñ n 
2SG 3SG 


In some languages (e.g., Tarok in Table 1), tone distinguishes singulars from 
plurals in only a subset of nouns, while in others (e.g., Koro Waci, Ndrulo), tone 
change is the normal method of making plurals from singular nouns. It appears 
that in the majority of languages which exhibit tone change to mark plural nouns, 
the plural nouns are in some way higher toned than the singular. However, this 
is not universal, as will be seen in Karaboro in section §3.3. 

Though verbal aspect may be the most common grammatical category distin- 
guished by tone, as in (3), other categories are not rare. (4) shows an example of 
tone distinguishing a locative from the bare noun, (5) exemplifies the syntactic 
subject/object feature distinguished solely by tone, and (6) exhibits a miscellany 
of language-specific grammatical relations distinguished by tone. 


marked tone is generally mid in a 3-level system. Unless indicated by other labeling, phonetic 
transcriptions are enclosed in square brackets [a], while orthographic representations are in 
angle brackets (a). ISO codes for languages are noted in the usual square brackets, e.g., [kma] 
for Konni in (1). 
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Table 1: Singular/plural nouns distinguished by grammatical tone 


Singular Plural Gloss 
a. Ndrulo [led] (Uganda) (Kutsch Lojenga 2014: 60) 
vini vini ‘his sister/s’ 
djani djani ‘his father/s’ 
b. Koro Waci [bqv] (Nigeria) (Rachelle Wenger, p.c.) 
sor isór ‘he-goat/s’ 
Ttómi Ttómi ‘work/s’ 
ibür ibûr ‘slime/s’ 
c. Tarok [yer] (Nigeria) (Longtau 2008: 90-91) 
ifang ifang ‘fingers/s’ 
ina ina ‘cow/s’ 
ntüng ntüng ‘hyena/s’ 
(3) Verbal aspect distinguished by grammatical tone 


(4) 


(5) 
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Mbembe [mfn] (Nigeria) (Barnwell 1969) 
5k5n “you sang’ 2kän ‘you should sing’ — mécht ‘he will eat’ 
dkon ‘you have sung’ ók 'ón ‘if you sing’ mochi ‘he will not eat’ 


Locative distinguished by grammatical tone 
Fur [fvr] (Sudan) (Kutsch Lojenga 2014: 61) 


baru ‘country’ bard ‘in the country’ 
dirá ‘hand’ dórà ‘in the hand’ 
ota ‘fire’ dtd ‘in the fire’ 


Subject/object relations distinguished by grammatical tone 
Sabaot [spy] (Uganda) (Kutsch Lojenga 2014: 66) 


krbakaac kwaan ‘his father left him’ 
krbakaac kwáán ‘he left his father’ 


Other relations 

Lugungu [rub] (Uganda) (Moe & Mbabazi 1999: 10) 
mulogo muhandü ‘an old witch’ 

mülógó muhandá ‘the witch is old’ 

mülógó muhandü ‘the witch, (she) is old’ 


6 Tone, orthographies, and phonological depth in African languages 


3 How tone is marked 


Local orthography developers and outside linguists have developed astonish- 
ingly varied and sometimes creative ways of marking tone in languages. In con- 
trast, some languages do not mark tone at all, even if they are distinctly tonal, 
and I start with these. 


3.1 No tone marking 


Here I look at a few languages with no orthographic tone marking at all. Inter- 
estingly, sometimes tone marking appears to be crucial to reading, and in other 
cases less so. 

The consensus among linguists I have spoken to is that the common way of 
writing Hausa in (7) (there are other systems) is quite difficult to read. This is 
especially due to the fact that the grammatical tone, as in the example, is not 
marked, and there are many situations where this ambiguity is impossible to 
resolve by the context. 


(7) Various verbal aspects 
Hausa [hau] (Nigeria) (Harley 2012) 


[jaa tafi] [jaa tafi] [ja tafi] 
<ya tafi) <ya tafi) (ya tafi) 
‘he went’ ‘he may go’ ‘he should go’ 


Kumam [kdi] (Uganda) has both lexical and grammatical tone: abe can mean 
either ‘an egg’ or ‘a lie? while ebedo can mean either ‘he lives’ or ‘he lived? How- 
ever, tone is not marked at all in Kumam, and 60% of people surveyed agreed 
that it is more difficult to read the Kumam Bible than Bibles in other languages 
(Edonyu 2015). 

Konni [kma] (Ghana) orthography does not mark tone. However, in contrast to 
the above languages, this seems not to make a significant difference in readability 
(my personal observation). In this language minimal pairs are few, so there is 
a fairly small functional load for lexical tone. Furthermore, there is very little 
grammatical tone in the language. People are able to read aloud fluently. 


3.2 Marking lexical tone 


Lexical tone, if it is marked, is marked by diacritics more frequently than not. 
Rangi [lag] of Tanzania, for example, marks lexical High tone, but only on nouns 
(e.g., iküfa ‘bone’, Stegen 2005). Similarly, Akoose marks High tone (e.g., edib 
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[èdib] ‘river’) and contours (e.g., kod [kod] age, but leaves Low unmarked 
(Hedinger 2011: 13). 

In a few cases, tone has been marked by punctuation marks before each word, 
especially in Cóte d'Ivoire (e.g., Bolli 1978). Examples of the punctuation marks 
used are displayed in Table 2. 


Table 2: Lexical tone notation for Cóte d'Ivoire languages (Kutsch Lo- 
jenga 2014: 58) 


extra high mid low extra mid- low- high- 

high low low high low 
falling rising falling 

"CV ‘CV CV -CV =CV CV- -CV’ ‘CV- 


This system can handle up to five tone levels, necessary in some languages of 
Cóte d'Ivoire. This is exemplified as follows in Attié, which has four contrastive 
levels of tone (but does not have extra low). 


(8) Attié (Matthew 6:30a) 
"Pete "yi "fa, fa "kan'a 'ló "a -bë ko fon- 'tshen'a to, 'eyipian -Zó -wo' sen 


'e hen dzhi ko ... 


3.3 Marking grammatical tone 


Different languages have used a wide variety of strategies for indicating gram- 
matical tone. One strategy is using diacritics, and often these mark a phonetic 
tone which instantiates a particular grammatical category, as in (9), with the 
Daffo variety of Lis Ma Ron. 


(9) Diacritic showing both phonetics and meaning 
Lis Ma Ron [cla] (Nigeria) (Harley 2012) 
á à 
'you (male)' ‘he’ 
Akoose exhibits a somewhat unusual pattern in that the singular and plural 


nouns for class 9/10 are identical, but the distinction is made by tone on the 
agreement prefix of the following verb: 
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(10) Akoose (Hedinger 2011: 13) 
a. [ngu: edélé] 
nguu edélé 
'the pig is heavy? 
b. [ngu: édélé] 
nguu édélé 


'the pigs are heavy' 


Some languages indicate grammatical tone by letters which are otherwise un- 
used. For example, Gangam [gng] (Togo) marks grammatical tone, not phoneti- 
cally, but with other symbols to indicate the meaning. The imperfective is marked 
with the letter 4h» and the perfective with an apostrophe €> (See Higdon et al. 
2000, also Roberts et al. 2013 for more examples. Phonetic transcription is from 
Jean Reimer p.c.). 


(11) Gangam (Higdon et al. 2000) 
a. N benge' [bénge] Miganganm ya kaanm. 
‘I learned to read Gangam: 
b. N laan bengeh [béngé] Miganganm ya kaanm ne. 


‘I am learning to read Gangam: 


Similarly, Etung (12) uses <h) to differentiate pronouns which differ only by 
tone. 


(12) Pronouns in Etung [etu] (Nigeria, Harley 2012) 


a. [á] b. [à] 
(ah) (a) 
‘they’ ‘he’ 


Other languages double some letters to differentiate pronouns which differ 
only by tone (13). 


(13) Pronouns in Jur Modo [bex] (Sudan, Persson 2004) 


a. [ni] b. [ní] 
(ni) Anni) 
‘her’ ‘their’ 
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A number of languages indicate various grammatical tone functions by means 
of punctuation or other non-alphabetic marks. Karaboro, as displayed in (14), uses 
a word-final hyphen to indicate plurals (in those cases which are not indicated 
by a segmental marker), which all happen to end in a low tone. 


(14) Plurals in Karaboro [xrb] (Burkina Faso, SIL 2009, as cited in Roberts et al. 


2013) 

a. [kai, kai] b. [gjos, gjoo] c. [sààpjé, sàápjé] 
«kai, kai-) (joo, j99-) (saapye, saapye-) 
‘affair, affairs’ ‘net, nets’ ‘rabbit, rabbits’ 


The old Ejagham orthography, now changed to a different system, used punc- 
tuation extensively to indicate various verbal aspectual forms (3). 


Table 3: Old Ejagham orthography [etu] (Nigeria & Cameroon) 
(Bird 1999a, corrected by John Watters, p.c.) 


Orthographic Rule Phonetic Orthography Gloss 

colon = PERFECT [émè] (e:me) ‘we have swallowed’ 
space = PERFECTIVE [emé] (e me) "we swallowed' 
apostrophe = HoRTATIVE [éme] (e'me) ‘let us swallow’ 
hyphen = CONDITIONAL  [émé] <e-me) ‘when we swallow’ 
no symbol = NouN [èmè] (eme) ‘neck’ 


The Bokyi orthography (Table 4) uses a system that appears rather unusual to 
most readers in its employment of a variety of non-alphabetic symbols, but it is 
currently in use. 

The Bungu language is one of the more complex illustrations of grammatical 
tone marking. It uses both diacritics and punctuation marks to indicate the inter- 
action of person and aspect in the verbal system. Many words are segmentally 
identical, and vowel length is not contrastive, putting a greater load on tone. At 
this point, no lexical tone is marked (though the orthography is still being ad- 
justed), and Table 5 does not give the entire picture of grammatical tone. Other 
complexities exist as well, such as tone marking of objects. 


3.4 Marking both lexical and grammatical tone with diacritics 


Zinza [zin] (Echizinza) marks both lexical and grammatical tone, with accent 
marks for high, rising, and falling (see 15). 
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Table 4: Bokyi [bky] (Nigeria) orthography (Harley, p.c.) 


Phonetic Orthography Gloss 

[ntsé] (nce) ‘going’ 

[ntsa] (n-ca) Igo 

[ntsé] (n/ce) ‘I went? 
[nñtsè] (nn/ce) ‘I have gone’ 
[ntfi n-tsa] (n/chi n-ca) ‘Iwill go’ 
[nnitséé] <n*-ce*) ‘I don’t go’ 
[ndatséé ] <n*da/ce*) ‘I didn’t go’ 
[mrhbatfi ntsaa] <n*ba/chi n-ca*) ‘T will not go’ 


Table 5: Bungu [wun] (Tanzania, Katterhenrich & Gray 2016) (Low tone 
is unmarked). Key: Colon: COMPLETIVE; Carat: PROGRESSIVE; Umlaut: 
2SG.SUBJ.PAST; Accent: 3SG.SUBJ.PAST; Double vowel: 3PL.SUBJ.PAST. 


Orthography Gloss 

(wákala) [wàkála] ‘you bought (recent) 
(wákala) [wákála] ^ "he bought (recent) 
(waakala) [wa:kala] ‘they bought (recent)' 
(wakala) [wakála] ‘they will buy’ 

(^wakala) [wa kala] ‘they are buying’ 
€wákala)  [wäkala] ‘you have already bought 
€wákala)  [wakäla] ‘he has already bought’ 
{:waakala) [wä:kala] ‘they have already bought’ 
«:nakala) [nakala] ‘T have already bought’ 
(nakala)  [näkala] ‘T am buying’ 
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(15) Marking both lexical and grammatical tone with accent in Zinza 
(Matthews 2010) 


a. lexical tone 


enzoka omuyánda 
‘snake’ ‘child, youth’ 
b. grammatical tone 
aleeba aléeba 
‘he looked’ ‘he (habitually) looks’ 


4 Phonological theory and orthography 


The above discussion has assumed that tones are completely stable, i.e., that un- 
derlying tones and surface tones are the same. The question of when or if to 
mark the results of tone rules offers more challenges. For example, in a Bantu 
language, if a prefixal High tone spreads for three syllables, does one mark the 
initial prefix syllable alone, or the result of the spreading rule? Or, in west Africa, 
if underlying tones in a word are /HLH/, but surface as [H'HH], what is the ap- 
propriate marking? The major question that involves both of these situation is: 
what depth of phonological representation should be the basis for marking tone? 
This section addresses those questions. 

Tone studies have advanced in the decades since the 1928 Rejaf conference, es- 
pecially with Autosegmental Phonology (Goldsmith 1976) and Lexical Phonology 
(Pulleyblank 1986). However, as Snider (2014) notes, many people do not apply 
phonological theory more recent than Halle & Chomsky (1968) to orthographies. 
Rather, the main distinctions that most orthographers have in mind are “deep” 
vs. “shallow” orthographies. However, as we will see, there are other options. 

A shallow orthography is close to or identical with the surface pronunciation, 
after most or all of the rules have applied. This has certain consequences and 
raises the following issues. 


* The same word will appear with different tone marks depending on its con- 
text. A "constant word image" (useful for quick word recognition) is not 
maintained. 


e It tends to be cumbersome and hard to read. Bird (1999b) showed that an ex- 
haustive shallow tone marking was actually less readable than no marking 
in Dschang. 


* How are multiple downsteps represented, when the tone can have several 
decreasing phonetic levels? 
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A deep orthography represents the sounds before the rules have applied. Very 
broadly, this is what linguists think of as the “underlying form” This also has 
certain consequences. A deep orthography has certain characteristics. 


e It retains a constant word image, aiding quicker visual recognition of a 
word; 


e It can sometimes be adapted better across dialects, since dialectal differ- 
ences can be attributed to varying rule application; 


e It can be significantly different than any person's actual pronunciation, 
including pronunciation in isolation. 


If a particular language has few tone processes, there will be little or no dif- 
ference between a shallow and deep orthography. The above does not exhaust 
the possibilities; Bird (1999a) and Roberts et al. (2013) give a number of other 
variations on marking tone. 


4.1 Lexical phonology as a useful framework 


Ihave mentioned “rules,” but what kind of rules do I mean? There is a rich history 
of types of rules and their interactions, and one would expect that a narrowing 
of types of rules would likely be helpful in determining tone orthographies. And 
so it is. 

Lexical Phonology (e.g., Pulleyblank 1986) is now disfavored as a comprehen- 
sive phonological theory, but the notion of lexical vs. postlexical processes is still 
invoked in contemporary theories such as Stratal Optimality Theory (Kiparsky 
2000; Goldsmith et al. 2014). I argue that Lexical Phonology offers a level of psy- 
cholinguistic realism that is helpful in determining which level to refer to in 
deriving orthographic representations. 

In Lexical Phonology, the output of the lexical level is the psychologically real 
level. This level is similar to but not precisely the same as the “phonemic level” 
of earlier theories. Following Snider (2014), I propose that this is the most appro- 
priate phonological level for orthography in general. Specifically for this paper, 
it is proposed that this level is the most fruitful level in applying the results of 
tone rules to an orthography. 

Snider (2014) is a major advocate of the above. One does not need to adopt 
the entire theory of Lexical Phonology to profit from its main benefits. The main 


113 


Michael Cahill 


question in dealing with a phonological rule that may make a difference in or- 
thographic representation is whether a rule is lexical or postlexical. Several diag- 
nostic questions can be fruitfully applied to determine this, which I have adapted 
with minor modification from Snider (2014). These questions are: 


* Are there lexical exceptions to the process? 
* Does a given process lack phonetic motivation? 


* Does the process have to apply across a morpheme boundary? (not a word 
boundary) 


If one or more answers to the above are “yes,” then the rule is a lexical rule; 
write the output of that rule. Other diagnostic questions: 


e Is the new sound the rule produces a non-contrastive sound in the lan- 
guage? 


e When a given process has applied, do native speakers think that the sound 
that results is the same as the sound that underwent the process? 


* Does the process apply across word boundaries? 


If one or more answers to the above are “yes”, then the rule is postlexical; write 
the sound at the level before the rule applies. Also, if there is no apparent reason 
to categorize a rule as lexical, Snider advises assuming it is postlexical. 


The above questions are a starting point for tentative decisions that should 
be held somewhat loosely; all orthographic decisions need to be actually 
tested.? 


The experimental evidence from Kabiye (Roberts et al. 2016) on two tone pro- 
cesses which were marked differentially in test orthographies supports this. The 
authors tested what they termed the Lexical Orthography Hypothesis, that is, that 
the lexical level (i.e., the output of the lexical phonology) offers the most promis- 
ing level of phonological depth upon which to base a phonographic tone orthog- 
raphy that marks tone exhaustively.’ 


?Gudschinsky (1958: 342-343) gives an interesting example of a Mazatec man (Mexico) who was 
quite aware of the results of tone processes within words (lexical rules), but insisted that the 
tones of two particular phrases were different, though they were phonetically tonally identical 
(result of postlexical rules). 

“Exhaustive tone marking" is marking the tone on every syllable. There is reason to believe 
that this is not the most effective way to mark tone, but it was adopted for the purposes of 
having a more controlled experiment. 
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They tested 97 tenth-graders with orthographies that represented two tonal 
processes in three different ways. The rules were: 


Lexical rule of L-spread: in the Kabiye verb, the L tone of a prefix spreads right- 
wards onto a H verb root until it is blocked by a singly linked H tone. This 
is shown to be lexical because it applies only across a specific morpheme 
boundary and is limited to within a word. Results of this rule are illustrated 
in Table 6. 


Postlexical rule of HLH plateauing: a singly linked L between two H tones delinks, 
and the second H spreads left and has a downstepped register. This is 
shown to be postlexical by the fact that it applies across word boundaries 
as well as within words. Results of this rule are illustrated in Table 7. 


The researchers tested 3 orthographies: 


Phonemic: the pronunciation minus application of any allophonic processes 


Lexical: (output of lexical phonology) the phonemic level minus application of 
any postlexical processes 


Deep: (input of lexical phonology) the lexical level minus application of any lex- 
ical processes, a morphographic representation. 


Examples of the orthographic output of these different systems are shown in 
Tables 6 and 7. 

Note that because of the specific processes chosen, the results of the Low- 
Spread rule distinguish a Deep orthography from the others. Is the Deep or the 
Lexical/Phonemic representation better? The results of the HLH Plateauing rule 
distinguish the Phonemic orthography from the others; is the Phonemic or the 
Deep/Lexical representation better? Table 8 shows the expected results if the 
Lexical Orthography Hypothesis is correct. Note that the experiment focused on 
the oft-neglected domain of writing as well as reading. 

The reader is referred to the paper for full results, but on the whole, the Lexical 
Orthography Hypothesis was supported. Lexical and Phonemic orthographies 
worked better in dealing with one tone process, and Lexical and Deep orthogra- 
phies worked better in dealing with the other tone process. So the Lexical orthog- 
raphy fared well in both processes, while the others did worse in one orthography 
or the other. Specifically, those writing the Lexical orthography: 
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Table 6: Low-spread and Kabiye orthographies 


Speech Deep Lexical and phonemic Gloss 


orthography  orthographies 
[wélési-@] (wélési) (wélési) ‘listen!’ 
listen-IMP 
[e-welesí-na] (ewélésína) ^ (ewelesína) ‘he listened’ 
3SG-NC1-listen-CoM 
[te-welesí-na] (tewélésína) <tewelesina) ‘didn’t listen’ 


NEG-listen-COM 


Table 7: HLH plateauing and Kabiye orthographies 


Speech Deep and lexical Phonemic Gloss 
orthographies orthography 


[sé-tes] <sétu> <sétu> ‘thanks’ 
thanks-Ncg 

[f£y1] Ri Ri ‘there is not’ 
there is no 

[s£'tó f£y1] (sétu féyt) <sé’tv FÉYL) ‘don’t mention it!’ 
thanks-Ncg 


there is no 


Table 8: Expected results from three experimental orthographies 


Orthography Lexical L tone spreading Post-lexical HLH 
plateauing 

Phonemic Written as pronounced (easier) Written as pronounced 
(harder) 

Lexical Written without 
post-lexical processes 
(easier) 

Deep Written morphographically 

(harder) 


116 


6 Tone, orthographies, and phonological depth in African languages 


scored fewer errors writing an appropriate accent on a vowel than those 
writing the Deep orthography; 


scored fewer errors writing post-lexical non-automatic downstep than 
those writing the Phonemic orthography; 


experienced less degradation of performance on a later test than those writ- 
ing the Deep and Phonemic orthographies; 


were more absorbed with the task of writing accents correctly than those 
writing the Deep and Phonemic orthographies (though this often caused 
them to write long vowels incorrectly). 


One caveat for the experiment is that there is not universal acceptance by 
researchers what the underlying (deep) tones of Kabiye actually are. Also, as 
mentioned before, this experiment focuses on lexical tone, exhaustively marked. 
Other less exhaustive methods of tone marking were not explored. 


4.2 Language typology as a useful guide 


Besides the largely theoretical insights of the Lexical Orthography Hypothesis, 
another promising tool for deciding how to mark tone is a more typological one. 
Kutsch Lojenga (2014) proposes two main types of tone languages. In her termi- 
nology, these are “stable tone languages” and “movable tone languages.” 

Stable tone languages are those in which tone rules do not change an underly- 
ing tone. They tend to have a cluster of properties: 


These languages tend to have shorter words, and more tone levels. 


Tone generally has a heavy functional load, both lexically and grammati- 


cally. 
Grammatical tone can be looked at as tone replacement. 
Writing tone on every syllable is possible and straightforward. 


Teaching phonetic tone awareness is (relatively) easier, and a constant 
word image can be maintained. 


Ndrulo and Attié, cited earlier, are examples of stable tone languages. 

Movable tone languages are those in which the tones change according to the 
context, due to a variety of tone sandhi rules. These also tend to have a cluster 
of properties which differ from the stable tone languages: 


117 


Michael Cahill 


These languages tend to have longer words and fewer tone levels. 


They generally have a lighter load for lexical tone, but often a heavy func- 
tional load for grammatical tone. 


Thus it may be less important to mark lexical tone, but it is important that 
grammatical tone distinctions be differentiated. 


Teaching tone awareness could focus on grammatical notions rather than 
phonetics 


Sabaot, Lugungu, and many Bantu languages are examples of movable tone 
languages. 

Of course, these language types are prototypical. Many languages do not fall 
neatly into these categories. However, this can serve as a general first approxima- 
tion and guide to the type of orthographic tone marking that may prove fruitful. 


5 Conclusions and recommendations 


I conclude this paper with several recommendations - some definite and others 
more tentative — and an open question on “selective tone marking,” 


5.1 Recommendations 


Some practices in orthography development have been confirmed enough by 
experienced people that I can definitely recommend these. 


1. First, work with the community! The emphasis in this paper has been on us- 
ability of the orthography, based on linguistic factors. However, if for any 
reason, the language community does not want to use a particular orthog- 
raphy, linguistic perfection becomes irrelevant. Various sociopolitical factors 
that can be relevant in different situations are discussed in Cahill (2014). 


2. All decisions on marking tone need to be tested. Unforeseen factors, including 
incomplete analysis, may result in one’s orthography not being as useable as 
anticipated. Whether the testing be formal or informal, one needs to check it 
with people who use the language (see Karan 2014 for details). 


3. If itis decided to mark lexical tone in the orthography, mark the output of the 
lexical level, as discussed in §4.1 
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4. When marking grammatical tone of whatever sort, prioritize marking the 
meaning, not the phonetics (in Roberts et al.’s 2013 term, “semiographically”). 
Readers and writers have meaning “in their heads” more than they do the 
abstract sound. Also, a particular grammatical meaning such as “recent past” 
may have several phonetic implementations. Figuring these out is a challeng- 
ing task, but one which, as far as orthography goes, is unnecessary. 


5. Consider how to teach the orthography. Even if speakers know their language 
is tonal, they often do not have a high awareness of the specifics of tone, let 
alone how to represent this. Each tone mark should be taught in a separate 
lesson, just as any consonant or vowel. Also, lexical and grammatical tone 
should be taught separately. 


. a tone orthography needs to be accompanied by a well thought- 
through methodology for awareness raising of tonal contrasts and for 
teaching people to read with the symbols chosen to mark tone in a lan- 
guage. (Kutsch Lojenga 2014: 52) 


6. Make the orthography compatible with electronic devices - phones, tablets, 
internet, and computers in general. A Unicode-compatible orthography* will 
be very helpful in the long run. Non-alphabetic symbols (e.g., * = +) are ap- 
pealing for marking grammatical tone, but a warning here is appropriate. The 
advantages of these marks is that they are already present on the keyboard, 
they can be written in line with the other characters rather than going back to 
add a diacritic, and they can mark an easily recognized meaning rather than 
the harder to process phonetics. However, the Unicode characteristics of these 
symbols mean that many programs will not treat them as part of the word, but 
will split them off from the usual consonants and vowels. Publishing can po- 
tentially be hindered if this issue is neglected. 


7. Finally, consider the writer as well as the reader. Active literacy in a language 
involves simplicity of writing as well as reading. 


The following are additional factors to consider as possibilities in orthography 
design, though I do not suggest them as firmly as the above recommendations. 
These seem reasonable, but have not been proven through practical experience 
to the extent that the definite recommendations above were. 


^Unicode is the international standard for encoding text in electronic data. Major software as- 
sumes user input uses Unicode characters and not a custom font. "Unicode-compatible" in our 
context means first, that only Unicode characters are used, and second, that they are used in ac- 
cordance with their defined set of properties. One of those properties is whether it is treated as 
a “word-forming” character. The usual equals sign (=, Unicode U+003D) is not word-forming, 
but a shortened equals sign (=, Unicode U+A78A) has been defined as word-forming. 
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When extra symbols are needed, consider writing them in line with other let- 
ters, rather than accents above the letter (e.g., “baba, not baba). These are easier 
to write, since the pencil or pen does not have to be lifted to a separate tier 
(think of writing an English word like constitution, which requires dotting (ips 
and crossing <t)s.) More testing and experience is needed, but this may also be 
possibly easier to read. 


Once the initial strangeness of such symbols in the orthography [Bokyi, see 
Table 4] has been overcome, and their function is understood, teams learn to 
use them quite quickly and can get quite excited about them. But teaching 
phonetic tone-marking using accents is always a struggle here, and very, 
very few ever master it. (Harley, p.c.) 


If both grammatical and lexical tone are to be marked, mark them with dif- 
ferent systems. Testing in Togo (Kabiyé language), Roberts marked lexical tone 
with accents, and grammatical tone with other characters. Roberts comments 
that readers seemed to “feel” the grammar more than the sound system. 


5.2 A closing question 


One convention that has been fairly widely practiced, but also has been opposed 
for theoretical reasons, is “selective tone marking” Selective tone marking ap- 
plies tone marking only to one word of a minimal tone pair, leaving the other 
unmarked. Thus if a language has two words [bóbó] and [bóbó], with different 
meanings, they could be written as (bóbo) and (bobo). Selective tone marking 
thus contrasts with marking tone more extensively or exhaustively. 

Wiesemann (1989: 16) and Longacre (1953: 132-133) assert that selective tone 
marking should be avoided. Wiesemann gives the following reason for rejecting 
selective tone marking: 


It should be mentioned here that a system which marks tone where it is 
minimally different in individual words is not a good system. In such a sys- 
tem, for each individual word one must learn whether it carries a tone mark 
or not. To mark low tones only on words where there is a minimal tone pair 
makes the teaching of tone a matter of memory, rather than a matter of 
rules linked to pronunciation. 


Longacre (1953: 133) adds the point that selective tone marking "presupposes 
that one has already made a list of all the words in the language to see which 


120 


6 Tone, orthographies, and phonological depth in African languages 


ones are minimal pairs. Such a claim is pretentious since most newly written 
languages do not have good dictionaries.” 

Thus two reasons for avoiding selective tone marking are 1) the memory load 
of having to know all the individual words which must be marked and 2) the im- 
probability of the orthography designer knowing all such word pairs (or triplets, 
or more) that need to be marked. 

However, dictionaries that include a large percentage of lexemes in a language 
are easier to produce now than in past years (http://www.rapidwords.net/). Also, 
the preference for rules rather than memorization a) is possibly a relic of Western 
education, with its bias against rote memorization, and b) ignores the fact that 
much of our successful (!) English orthography also depends on memorization 
rather than rules, as the examples in Table 9 show. 


Table 9: Variable English pronunciation of same spellings 


Spelling of (ough) words Phonetics Spelling of (ear? words Phonetics 


cough [af] hear [ix] 
though [o] heard [a] 
through [u] heart [ar] 


English orthography is far from being an ideal model, but if such a widely-used 
orthography can depend so much on memorization, then the argument based on 
memory loses its force. So a better case can probably be made for selective tone 
marking than previous scholars have argued. 
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Prosodic restructuring in Somali 
nominal constructions 


Laura J. Downing 


Morgan Nilsson 
University of Gothenburg, Sweden 


This paper investigates variability in the realization of High tones in some nominal 
constructions of Somali. Previous work on Somali tone suggests that most deter- 
miners and all nominal modifiers should realize a High tone when they combine 
with the noun they modify. However, our study finds that there is considerable 
variability in the realization of High tones on nominal determiners and modifiers, 
with the High tone often not realized. This phenomenon of variability in tone re- 
alization is quite reminiscent of Swedish, which shares with Somali the property 
that tone realization is culminative within some prosodic domain. Recent work on 
Swedish has argued that variable non-realization of High tone is best analyzed as 
prosodic restructuring: reducing the number of culminative tonal domains in a con- 
struction necessarily leads to a reduction in the number of surface High tones. We 
argue that prosodic domain restructuring also provides the most plausible analysis 
of High tone reduction in Somali nominals. 


1 Introduction 


It is uncontroversial that the Somali tonal system has canonical stress-like prop- 
erties, as defined in Downing (2010) and Hyman (2006; 2011; 2012; 2014). Work 
by Hyman (1981; 2006; 2012); Le Gac (2003); Green & Morrison (2016), and Saeed 
(2004) agrees that High tone is culminative: no more than one High tone can 
occur per (minimal Phonological) Word (PWord). The position of High tones is, 
roughly, demarcative: they occur on either the penult or final mora of a PWord. 
Only some proper names have a High tone in another position (Saeed 1999: 22). 
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How to account for these generalizations, which hold for PWords in isolation, is 
more controversial. Some possibilities are: underlying accent (Banti 1988; Green 
& Morrison 2016; Le Gac 2003), (underlying) High tone (Andrzejewski 1964; 1979; 
1981; Armstrong 1934; Hyman 2006; 2014; Le Gac 2016), or no underlying tone or 
accent, rather surface tone is the result of morphological tone/accent assignment 
principles (Hyman 1981; Mous 2009). Addressing this problem in detail is outside 
the scope of this particular paper. However, we do assume that Somali is a tonal 
language, not an (underlyingly) accentual one. 

The goals of our investigation are twofold: to document the realization of High 
tones in some nominal constructions, and to account for the position and number 
of High tones that occur within these constructions in terms of matches and mis- 
matches between morphosyntactic structure and prosodic structure.! The central 
empirical finding of the paper is presented in §2, which shows that a number of 
Somali nominal constructions in our corpus do not have the tone pattern ex- 
pected from the previous literature because the expected High tone on the deter- 
miner or modifier is “missing”. In §3, we argue that familiar tone or intonation 
processes like the Obligatory Contour Principle (OCP) or Final Lowering do not 
plausibly account for the missing High tones. In §4, we propose that prosodic 
restructuring provides a better account: a reduction in the number of prosodic 
domains a construction is parsed into leads to a reduction in the number of High 
tones that can be realized in the construction when High tone is a culminative 
property of the domain. We draw a parallel between the prosodic restructuring 
found in Somali and the prosodic adjunction processes that have been proposed 
by Myrberg & Riad (2015) and Riad (2016) for Swedish, a language with a surpris- 
ing number of prosodic properties in common with Somali. 


2 Data to be accounted for 


This paper presents preliminary results of a study of the prosody of some nominal 
constructions of Somali, based on recently collected elicitation data. We begin by 
summarizing the sources of High tones expected in nominal constructions for the 
data we investigated. Then we present the tone patterns attested in our data and 
give more information about our corpus. 


'We adopt the approach to defining mismatches between morphosyntactic and prosodic struc- 
ture developed in work like: Downing (1999; 2016); Inkelas (1993; 2014); Itó & Mester (2012; 
2013); Nespor & Vogel (1986); Riad (2012); Selkirk (1986; 2011); Vigário (2010), and Vogel (2010). 
See Green & Morrison (2016) for a recent alternative analysis of Somali prosody within this 
general framework. 
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2.1 Expected High tones in (non-subject) nominal constructions from 
the literature 


Hyman (1981); Saeed (1993; 1999) and Green & Morrison (2016) show that all So- 
mali nouns in isolation have a High tone on either the penult or the final mora. 
That is, High tone is obligatory in certain morphosyntactic constructions. The 
position is determined by morphological factors (e.g., declension class; “gender”; 
singular vs. plural), not phonological factors. We give a few examples in (1); note 
that compounds have a single High tone, on either the penult or the final mora: 


(1) Somali nominals 


a. tonal minimal pairs 


inan ‘boy’ vs. inan ‘girl’ 
béer ‘liver’ vs. beér ‘garden’ 
^ € 5 ^ € > 
éy ‘dog vs. ey ‘dogs 


b. tone on penult vs. ultima in phonologically analogous words 


dukaan ‘shop’ 
caleén ‘leaf 
sonkór ‘sugar’ 
kibis ‘bread’ 
subag ‘butter’ 
mindi ‘knife’ 
guri ‘house’ 
c. compounds 
dayaxgacméed ‘satellite’ (cf. dayax ‘moon’; gacmeéd ‘of hands’) 
lacaghaye ‘cashier’ (cf. lacág ‘money’; hay- ‘have, hold’; -e agentive) 
caanagéel ‘camel milk’ (cf. caanó ‘milk’; géel ‘camels’) 
madaxweyne ‘president’ (cf. mádax ‘head’; wéyn ‘big’) 


Nouns can be followed by a number of determiners. As shown in the list in 
(2), while the definite determiner is toneless, the other determiners introduce a 
High tone: 


?The k/g/h/O vs. t/d/sh alternations in the Somali determiner system illustrated in our data 
is conditioned by gender agreement: masculine nouns take the k/g/h/@ series of determiners, 
while feminine nouns take the t/d/sh series. The allomorphy found in both series of determiners 
is phonologically conditioned. See, e.g., Saeed (1993; 1999), for detailed discussion of this sandhi 
phenomenon. 
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(2) Somali determiner types (Saeed 1999: 111-117) 

a. Definite -ka/-ta 

b. Remote definite -kii/-tii 

c. Interrogative -keé/-teé ‘which’ 
Possessives -káyga/-táyda ‘my’, -káaga/-táada ‘your (sg.)’, -kiisa/-tiisa 
‘his’, -kéeda/-téeda ‘her’, -kayága/-tayáda ‘our (excl.)’, -kéenna/-téenna 
‘our (incl.)’, -kiinna/-tiinna ‘your (pl.)’, -kóoda/-tóoda ‘their’ 

e. Demonstrative -kán/-tán ‘this’, -kaás/-taás ‘that’ 

According to work like Green & Morrison (2016); Hyman (1981), and Saeed 
(1993; 1999), when the High-toned determiners occur in combination with a noun, 
they retain their High tone, as illustrated in (3). While Hyman (1981: 191) mentions 
a process of accent (High tone) reduction on possessive determiners following a 


noun, only an example or two is provided. None of the determiners change the 
tone of the base noun, except -keé/-teé ‘which?’ (3h, 3i). 


(3) Somali nouns with determiners (Saeed 1993: 160—168) 
a. nín man | nín-ka ‘the man’ 
b. naág ‘woman’ | naág-ta ‘the woman’ 
c. nín'man' | nín-kán ‘this man’ 
d. naág ‘woman’ | naág-tán ‘this woman’ 
sáddex ‘three’ | sáddex-daás ‘those three’ 
shuqul ‘work’ | sháqul-káyga ‘my work’ 


lacag ‘money’ | lacág-táada ‘your money’ 


DO mo 


nin ‘man’ | nin-keé ‘which man?’ 


EH 


Z D H 4 € H 2 
naág ‘woman’ | naag-teé ‘which woman? 


Green & Morrison (2016); Hyman (1981), and Saeed (1993) observe that the 
modifier in a Noun+modifier phrase - Noun+Noun or Noun- Adjective - is also 
expected to be realized with a High tone when the phrases are pronounced in 
isolation.? This is illustrated in (4), where we see that a High tone is assigned 
to the final vowel of the (indefinite) postnominal modifier, while the modified 


3Noun+Noun (N+N) modifier phrases are called “genitive constructions" by Hyman (1981) and 
Saeed (1993). Green & Morrison (2016) refer to N+N modifier phrases as “associative construc- 
tions" and, following Saeed (1993), use the term “attributive adjective" for the Noun+adjective 
construction. See Saeed (1993) for detailed discussion of Noun+modifier constructions in 
Somali. 
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noun keeps its base High tone pattern.* Note that numbers are considered nouns 
(Saeed 1993: 123) and can head Noun+modifier phrases, as shown in (4b, 4c). 


(4) Somali Noun+modifier phrases (Hyman 1981; Green & Morrison 2016; our 
elicitation notes) 


a. géed wiil ‘a tree of a boy’ (cf. wiil ‘boy’) 

b. afar buüg ‘four books’ (cf. báug book) 

labó sabuurad-oód ‘two blackboards’ (cf. sabuurád “blackboard’) 
gacán-ta midig ‘the right hand’ (cf. mídig ‘right side’) 

mindí-da Maxaméd ‘the knife of Maxamed’ (cf. Maxámed) 


f. gaarí cusüb ‘a new car’ 


> p 


m 


g. shúqul adág ‘hard work’ 


In sum, many nominal constructions are expected to have two High tones: one 
on the noun and one on the following determiner or modifier (noun or adjective). 

In keeping with the one High tone per Prosodic Word (PWord) principle, Green 
& Morrison (2016) propose that Noun+determiner (H-toned) and Noun+modifier 
constructions have the same prosodic representation: namely, they are both 
parsed as independent PWords from the noun they modify. The representations 
in (5) adapt Green & Morrison’s (2016) analysis by abstracting away from the 
PWord-min vs. PWord-max distinction they argue for; surface High-toned mor- 
phemes are bolded: 


(5) Prosodic structures for Somali nominals; parentheses indicate PWords 
(adapting Green & Morrison 2016) 


a. N+definite (N)pworg def 

b. N+H-toned determiner (N)pwora (Det)pword 
c. compound (ON) (N))pword 

d. N+modifier (N)pwora (Modif) pword 


Notice the parallelism in the structure of N+determiner (5b) and N+modifier 
(5d). These representations form the starting point for our investigation. 


“Only a few adjectives, e.g., dhéer ‘long’, wéyn ‘big’ (Saeed 1999: 105-106), and some fe- 
male proper names seem to constitute exceptions to the generalization that modifiers in 
Noun+modifier phrases have a High tone on the final mora. 
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2.2 Our data 


The new data discussed in this paper were collected in 2016 through elicitation at 
the University of Gothenburg, working mainly with one speaker from Kismayo 
and two speakers from Mogadishu. The overall corpus comprises 10,002 individ- 
ual utterances (tokens) representing 2,970 different types. The NPs were provided 
both in isolation and in short sentences, in morphosyntactic contexts where the 
variable High tone is expected to consistently occur. 

What we find in these data is that the nominal constructions in (5b) and (5d) 
do not consistently have the High tone patterns expected from previous work on 
Somali prosody. Instead, the High tone on the determiner or nominal modifier is 
often “missing”. For example, the possessive determiner is seldom realized with 
its expected High tone. Out of a total of 1,068 instances of N+possessive construc- 
tions in our corpus,” 929 (87%) do not realize a High tone on the possessive suffix. 
This is illustrated in (6). 


(6) a. biyá-hayga ‘my water’ (~ biyá-háyga) 
b. dhég-tiisa ‘his ear’ (~ dhég-tiisa) 
c. mindi-diisa ‘his knife’ (~ mindi-diisa) 
d webi-gooda ‘their river’ (~ webí-góoda) 


e. biyó-hooda ‘their water’ (~ biyó-hóoda) 
f. bisád-deeda ‘her cat’ (~ bisád-déeda) 


The pitch pattern of a typical possessive with the High tone missing, like (6f), 
is shown in Figure 1. 


220 Hz 


Ke bk E eu Da 


50 Hz 


Figure 1: bisaddeeda ‘her cat’ 


Our data thus tends to confirm Hyman’s (1981) observation that the posses- 
sive undergoes accent (High tone) reduction, except that High tone reduction is 
variable in our corpus: it does not occur 100% of the time. 


"This set of 1,068 excludes constructions containing the shorter (indefinite) possessive suffixes, 
in which tonal differences reflect the distinction between a non-focused subject and other 
syntactic functions. 
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Interestingly, our data shows that the likelihood of realization of the deter- 
miner’s High tone is construction specific. For example, the demonstrative suf- 
fixes also often lack a High tone: out of a total of 635 instances of Noun+demon- 
strative constructions in our corpus, 369 (58%) do not realize a High tone on the 
demonstrative suffix. This variation is illustrated in (7): 


(7 a. wiil-kan ‘this boy’ (-wíil-kán) 
b. gaari-gaas ‘that car’ (-gaarí-gaás) 
c. Talaadá-daas ‘that Tuesday’ (~ Talaadá-daás) 


This is a significantly lower proportion of missing High tones, however, than 
found with the possessive suffixes. 

In contrast, the remote definite suffix more generally realizes its High tone. 
Out of 997 instances of Noun+remote definite constructions in our corpus,° only 
358 (36%) do not realize a High tone on the remote definite suffix. This variation 
is illustrated in (8): 


(8) a. mindi-dii that (remote) knife’ (~ mindi-dii) 
b. nin-kii ‘that (remote) man’ (~ nin-kii) 


c. qoraallá-dii ‘those (remote) texts’ (~ goraalla-dii) 


Finally, Noun+modifier constructions should have a High tone assigned to the 
final syllable of the second word.’ Yet, in our data, this High tone is again of- 
ten missing. Out of 571 instances of Noun+indefinite Noun constructions in our 
corpus,® 391 (68.5%) do not realize a High tone on the second noun. This varia- 
tion is illustrated in (9a, 9b); recall that numbers are nouns in Somali. Out of 599 
instances of Noun+Adjective constructions, in contrast, only 297 (50%) do not 
realize a High tone on the modifier. This variation is illustrated in (9c, 9d):? 


*This set of 997 excludes constructions where the remote definite suffix occurs in a non-focused 
subject NP. 

"Nouns can be followed by more than one modifier, and it is the final modifier which should 
realize a final High tone. We consider here only nouns followed by a single modifier, for ease 
of comparison with the Noun+determiner data. 

*This number excludes feminine nouns - mostly personal names and place names - with a 
fixed (exceptional) accent on a non-final mora, which show much less variability in their tonal 
behavior. 

?This set of 599 excludes constructions where the adjective either has the subject suffix -i or 
has an exceptional, fixed penult accent (in our data: wéyn ‘big’, dhéer ‘long’, macáan ‘sweet’), 
which show almost no variability in their tonal behavior. 
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(9) a. hal litir ‘one liter’ (~ hál litir) 
b. guri-ga Muuse ‘Musa’s house’ (~ guri-ga Muusé) 


c. biyó badan ‘much water’ (~ biyó badán) 


d. subáx-dií hore ‘(in) the early morning’ (~ subáx-dií horé) 


To sum up this section, High tones on determiners and nominal modifiers are 
often not realized. All the speakers exhibit the same range of variation, so it 
cannot be attributed to dialectal or individual differences. In the next section, we 
take up two untenable phonological accounts - the OCP and Final Lowering - 
for why these High tones are missing before arguing for an alternative analysis 
in §4. 


3 Why the OCP and Final Lowering cannot account for 
the missing High tones 


3.1 The OCP 


Looking at Figure 1, above, one might propose that the High tone on the posses- 
sive suffix - and other determiners - is deleted as an OCP effect (Leben 1973): ina 
sequence of adjacent High tones, one is deleted. However, this explanation faces 
the problem that the OCP is not a general principle of the Somali tone system. 
When two consecutive High tones occur, they are normally realized on almost 
the same pitch level. This is illustrated in the pitch track for (9d), subaxdii hore 
‘(in) the early morning’ given in Figure 2. 


170 Hz 


50 Hz 


su- bax- dií ho- re 
Figure 2: subáxdií hore '(in) the early morning’ 
Therefore, tonal reduction on possessives and other determiners is not plau- 
sibly motivated by the OCP. Furthermore, the OCP is not relevant in the case 


of the missing High tones on the final vowels of modifiers in Noun«modifier 
constructions, such as hore ‘early’ in this example. 
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3.2 Final Lowering 


The Somali tone literature (Andrzejewski 1981; Armstrong 1934; Hyman 1981; Le 
Gac 2003; Saeed 1993; 1999) notes that High tones are lowered phrase-finally/ 
pre-pausally/post-focally. Since the “missing” High tones in our data - in both 
Noun+determiner and Noun+modifier constructions — often occur in phrase- 
final position, one might propose that Final Lowering is responsible for giving 
the impression that the High tones have been deleted. However, Final Lowering 
does not provide a general explanation for the missing High tones in our data. 
First, the expected High tone on the possessive suffix (e.g., dhég-tíisa ‘his ear’) is 
not associated with the final syllable or mora, so Final Lowering is not relevant 
here. Second, final High tones are not deleted in our data in other morphosyntac- 
tic contexts, such as lexical words in isolation (10) and in sentence final position 
(11), where the final High tone is expected to be realized. Non-lowered final High- 
toned vowels are bolded: 


(10) Words in isolation 
a. tukayadl ‘crows’ 
b. ubaxyó ‘flowers’ 


c. lafdhabarró 'spines' 


(11) Sentence final position 
a. Wáxaan arkay ratí. ‘I saw a camel: 
b. Waxaan lá kulmay Sahró. ‘I met with Sahra. 
c. Hooyadáa waa macallimád. ‘Your mother is a teacher. 


d. Libáax ayaa dilay dawacó. ‘A lion killed a fox: 


That is, final High tones are not systematically deleted. Only High tones on 
(some) postnominal determiners and modifiers are. 

Third, High tones can be deleted from the word-final syllable of a determiner 
or a modifier even when the word is not phrase-final or pre-pausal: 


(12) Vowels with “missing” non-final High tones are bolded 

a. labá dámar ah oo qurüx badan oo kaliya 
‘just two beautiful women’ (cf. labá ‘two’; dámar ‘women’; qurüx 
‘beauty’; badán ‘much’; kaliya ‘only’) 

b. sánnad-kii hore ~ sánnad-kií hore 
‘last year’ (cf. sánnad ‘year’; horé ‘previous’) 

c. bisád-daas yar ~ bisád-daás yar 
‘that little cat’ (cf. bisád ‘cat’; yar ‘small’) 
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To sum up this section, neither the OCP nor Final Lowering can account for 
the missing High tones in N+determiner and N+modifier constructions, because 
neither of these processes applies generally in Somali. Instead, High tone dele- 
tion/non-realization appears to be construction specific. Further, High tones are 
often deleted when the context for neither the OCP or Final Lowering is met. 


4 Tone “deletion” as prosodic restructuring 


4.1 Parallels with Swedish prosody 


Coming from a Swedish background, one is struck by the similarities between 
the prosodic systems of Somali and Swedish, another language with a stress-like 
tone system. As argued for in recent work by Riad (2012; 2016) and Myrberg & 
Riad (2015), Swedish has prosodic culminativity of stress at the PWord nin) level 
and culminativity of tone at the PWord(max) level; compounds are a single tone 
realization/assignment domain; and some affixes are stressed, while others are 
not. Data illustrating these properties for Swedish are given in (13-15); all of the 
data are cited from Myrberg & Riad (2015). In (13), we see that Swedish does not 
have secondary stress, unlike English or German: 


(13) PWord culminativity for stress 

a. American English and Swedish stress 
i. (mone tary), (monetär),,min=max 
ii. (totalitarian), (totalitär),,min=max 
iii. (ab stract),, (ab'strakt)„min=max 

b. German and Swedish stress 
i. (militarisieren),, (militarisera),,min=max 
ii. (ono matopoetisch),, (onomatopoetisk),,min=max 


iii. (universität), (universitet),,min=max 


The data in (14) shows that compounds have two stresses but a single tonal 
accent (accent 2, indicated with a diacritic superscripted before the compound) 
realized over the entire compound and replacing the isolation tone pattern of the 
words making up the compound: 


(14) Compounds 


a. sommar-lov ?((sommar;),min( lov;),,min),,max 
'summer break' 
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b. jul-lovs-morgon ?((‘jul;),,min(lov,-s),,min(morgon,),,min),,max 
‘Christmas break morning’ 

c. jul-klapp 2((jul,),,min(klapp,),,min),,max 
‘Chistmas present’ 


The data in (15) illustrates that some affixes bear a stress and takes part in the 
realization of a tonal accent 2 in the construction, while others — (15c) - bear 
neither stress nor tone: 


(15) Affixes 
a. tvatt-bar "((tvátt),,min( bar), ,min),,max ‘washable’ tonic suffix 
b. o-nódig 2((‘o),,min( nöd-ig) min) max ‘unnecessary’ tonic prefix 
c. fór-àndra (för-! (ändra) min) max ‘to change’ 
All of these properties also characterize Somali's prosodic system, as we have 


seen in the preceding sections. Table 1 summarizes the similarities in the prosodic 
systems of the two languages. 


Table 1: Comparison of the prosody of Swedish and Somali 


Somali Swedish 


PWord culminativity v^ v 
compounds are 1 tone realization domain v v 
stressable/tone bearing affixes v v 


^tone 
"stress (PWord,,i,); tonal accent (PWord,,,.) 


4.2 Prosodic restructuring in Swedish 


Another property the Swedish and Somali prosodic systems have in common is 
that a tone (accent) is obligatory. In Swedish, tonal accent is obligatory for all 
PWord(max), and in Somali, for all nominals and modifiers when pronounced in 
isolation. Yet, in both languages, an expected High tone (or accent) is sometimes 
“missing”. We illustrated the “missing” High tones of Somali in 83. For Swedish, 
as work like Garlén (1988); Myrberg & Riad (2015) and Riad (2016) reports, words 
in some short phrases, which sometimes are even structurally similar to those in 
our Somali data, variably lose their tonal accent. This phenomenon is illustrated 
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in (16). As we can see in this data, the input tone-accent is sometimes simpli- 
fied. Accent 2 can be reduced to accent 1 - as in (16a). The examples in (16b- 
16c) illustrate that often only the accent of the rightmost PWord in the phrase 
is consistently realized. (In this data set, subscripted numbers indicate the input 
tonal accents. Superscripted numbers indicate the tonal accents realized under 
prosodic adjunction): 


(16) Prosodic restructuring in Swedish (Myrberg & Riad 2015) 


a. Prosodic adjunction in morphology and syntax 
morphology: (fór-! (ändras) p) max ‘to change’ 
syntax: (for 1(liteng),,),,max ‘too small’ 
(for 1(‘mangay),,),,max ‘too many’ 
(for ! (lànge;),,), max ‘too long’ 
b. Lexicalized phrases with prosodic adjunction 
((ràd-a5),, ?(matt-an;),,),,max ‘red carpet’ (lexicalised phrase) 
((Rôd-a),, !(Kors-et1)o) max ‘Red Cross’ (name, lexicalised) 
((hopp-a2)., !(upp,),.),max ‘jump up’ (particle verb) 
((hel-a5),, (làng-a2),, ‘(dag-en,),),,max ‘all day, lit. whole long day’ 
c. Local deaccentuation as the result of prosodic adjunction 
((liten;),, ?(smuts-ig;),;),,max ?(gryt-a2),,min-max ‘little dirty pot’ 
((lag-a2),, (be-! (gagn-ade;),,),,) max (kopial'torer;),,min=max 
‘repair used copying machines’ 


To account for this variation in output accent realization, Myrberg & Riad 
(2015) and Riad (2016) propose that both words and affixes can be incorporated 
into a single PWord(max) via prosodic restructuring (adjunction). (Their restruc- 
turing analysis is illustrated in the prosodic representations in (16).) Prosodic 
restructuring accounts for the tonal reduction found, given the culminative one 
tonal accent per PWord(max) principle which Riad (2016) shows holds for Swedish. 
Reducing the number of PWord (max) in a phrase necessarily reduces the number 
of tonal accents that can be realized. 


4.3 Prosodic restructuring in Somali 


What we propose is that Somali High tone reduction is the result of prosodic 
restructuring, analogous to what has been proposed for Swedish by Myrberg 
& Riad (2015) and Riad (2016) and illustrated in the preceding section. We also 
adopt from their analysis a distinction between PWord(min) and PWord max)” 


Following Myrberg & Riad (2015) and Riad (2016), we adopt a distinction between PWord-min 
and PWord-max as defined in work like Itó & Mester (2012; 2013). 
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We follow work like Green & Morrison (2016); Hyman (1981; 2006; 2012); Le Gac 
(2003) and Saeed (2004) in assuming that a culminative one High tone per PWord 
principle holds for Somali. For High-toned determiners like the possessive to be 
realized with a High tone, we propose that they must therefore be parsed in a sep- 
arate PWord{min) from the noun they modify. However, High-toned determiners 
are arguably parsed in the same PWord(max) domain as a preceding noun, because 
they undergo segmental sandhi processes which do not apply across PWord (max) 
boundaries. The representations in (17) illustrate our analysis; 


(17) 


a. ((dhég)pword-min (tiisa)pWord-min)PWord-max ‘his ear’ 
b. ((mindí)pword-min (díisa)pword-min)PWord-max ‘his knife’ 


Note that the alternation between -diisa and -tiisa in (17) is an example of a 
sandhi process that only applies across a PWord-min boundary within PWord- 
max in our analysis. (See Saeed 1999: 28-31 for detailed discussion of these seg- 
mental sandhi processes.) 

In the variable pronunciation where the possessive, for example, has lost its 
High tone, we propose that the construction has the same recursive PWord struc- 
ture as the toneless definite determiner suffix - cf. (5a). That is, the construction 
has undergone prosodic restructuring like that found in Swedish (16), so that the 
possessive suffix is not an independent PWord,min), but rather is adjoined to the 
preceding noun within PWord (ax), as shown in (18) - cf. (17a) and (18b): 

(18) 


a. ((dhég)pword-min ta)PWord-max 'the ear 
b. ((dhég)pword-min tiisa)PWord-max ‘his ear’ 


Since High tone is culminative and obligatory within the PWord(min) domain, 
reducing the number of PWord(min) within a PWord(max) domain necessarily 
leads to a reduction in the number of High tones realized within PWord (ax). 

We propose that Noun+modifier phrases that have “lost” the High tone on 
the modifier have a similar analysis. They also undergo prosodic restructuring, 
but they are parsed into a different prosodic domain from the restructured deter- 
miners. Following Vigário (2010) and Vogel (2010), we assume a Complex Word 
Group (CWG) constituent, which is the domain of, for example, tone assignment 
to compounds. Recall from (1c), above, that compounds in Somali form a single 
tonal realization domain; more examples are given in (19): 


"See Green & Morrison (2016) for an alternative account of similar Somali data. Space does 
not permit a careful critical comparison of our analysis with theirs. The interested reader is 
directed to their work for details. 
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(19) a. madax-weyn-e (m.) ‘president’ (cf. mádax (m.) ‘head’; wéyn ‘big’; -e 

agentive suff.) 

b. cod-kác (m.) ‘tonal accent’ (cf. cód (m.) ‘voice’; kac- ‘rise’) 

c. biya-dhác (m.) ‘waterfall’ (cf. biyó (pl.) ‘water’; dhac- ‘fall’) 

d. bad-wéyn (f.) ‘ocean’ (cf. bad (f.) ‘sea’; wéyn be) 

e. magaala-mádax (f.) ‘capital’ (cf. magaálo (f.) ‘town’; mádax (m.) 
‘head’) 

f. laf-dhabar (f.) ‘spine’ (cf. laf (£) ‘bone’; dhábar (m.) back) 


We propose that restructured Noun+modifier phrases are also parsed into a 
CWG, as illustrated in (20) - cf. (5d), above: 


(20) ((hal)pword-max litir)cwe ‘one liter’ 


It is interesting to note that tonal reduction results in a High tone on the left- 
most word of the Noun+modifier construction, whereas compounding typically 
results in a single High tone on the rightmost word. We conclude that these two 
constructions must have a different prosodic representation. As illustrated in (21), 
prosodic restructuring of a Noun+modifier leads to a left-headed construction, 
whereas compounds are right headed:!* 


(21) prosodic restructuring: ((hal)pword-max litir)cwo ‘one liter’ 
vs. 
compound: (cod (kac)pword-max)Cwc tonal accent’ 


Both CWG constructions are posited to contain a PWord-max in order to ac- 
count for the fact that segmental sandhi processes do not occur across the words 
in the CWG domain: 


(22) No segmental sandhi in the CWG domain 
prosodic restructuring: 
((laba)pword-max tuug)cwc (“laba duug) ‘two thieves’ 
vs. compound: 
(keli (talis)pword-max)cwc Č keli dalis) ‘dictatorship’ 
(cf. kéli (m.) ‘being alone’; talis ‘government’) 


ZA reviewer notes that another difference between restructured N+modifier phrases and com- 
pounds is that the tonal accent (and prosodic structure) assigned to compounds never varies. 
We propose that the lack of variation in compounds is due to the fact that they are lexicalized 
forms, morphosyntactically non-compositional. High tone variation in our data is found in 
compositional forms. 
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To account for tone assignment in these constructions, we propose that, in 
nominal CWGs, the High tone assigned to a CWG is realized on the internal 
PWord(max). This is again analogous to Swedish, which only allows one tonal 
accent to be assigned both to a compound or compound-like construction, as 
well as to a construction that has undergone prosodic restructuring. The accent 
assignment principles for compounds and restructured words and phrases are 
not identical in Swedish. Note that this also generally holds true for Somali. 

One last point in favor of our analysis is that it accounts for why determiners 
are added only to the rightmost word in a compound, whereas each noun in a 
Noun+Noun construction can have its own determiner, as shown in (23). 


(23) compound Noun+Noun 
a. guri-márti-ga ‘the guesthouse’  güri-ga marti-da ‘the guests’ house’ 
b. caana-géel-a ‘the camel milk’  caanä-ha géel-a ‘the camels’ milk’ 


Recall that compounds and Noun+Noun constructions are also potentially dis- 
tinguished by tone, as shown in (23). Noun+Noun constructions can have two 
High tones, one for each PWord, when they are not restructured, while com- 
pounds, which contain only one PWord, only have one High tone: 


(24) compound Noun+Noun 
a. (guri(marti)pword-max)CWG b. (gári)PWord-max 
‘guesthouse’ (martí)pword-max ^ 

((gür i)PWord-max mar ti)cwc 
‘guests’ house’ (cf. guri 
‘house’; marti ‘guests’) 
c. (caana(géel)pword-max)CWG d. (caanóÓ)pword-max 

‘camel milk’ (geél)pword-max ~ 
((caanó)pword-max geel)cwa 
‘camels’ milk’ (cf. caanó 
‘milk’; géel ‘camels’) 


We propose that a determiner takes a PWord as its base. Since compounds 
contain only one PWord, as shown in (24a,c), they can take only one deter- 
miner. However, each noun in a Noun+Noun construction can be parsed as a 
PWord, as shown in (24b, d), and so each can take a determiner. Strikingly, Noun1- 
determiner+Noun2-determiner constructions cannot be restructured; the High 
tone of the second noun is always realized. The determiner, which requires a 
PWord base, appears to block tonal reduction in Noun2. In our analysis, this falls 
out from proposing that both the determiner and the High tone require a PWord 
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base. When Noun2 is not parsed as a PWord, under restructuring (see 25d), it can 
bear neither a determiner nor a High tone. 

Our analysis is summarized schematically below (cf. 5, above); ‘X’ in com- 
pounds indicates that compounds can be made up of a combination of lexical 
words (X): 


(25) Summary of our analysis 


a. N+definite: ((N)pword-min def)pwora-max 
b. N+H-toned determiner: ((N)pword-min (Det)pword-min)PWord-max 
OR restructured ((N)pword-min Det)pword-max 


c. compound: (X (X)pword-max)CWG 
d. N+modifier: (N) PWord-max (Modif)pword-max 
OR restructured ((N)pword-max Modif)cwo 


5 Conclusion and topics for future research 


Many nominal constructions in Somali do not realize the expected High tones on 
the determiner or modifier; that is, they undergo tonal reduction. We argue that 
tonal reduction in Somali is the consequence of prosodic restructuring, rather 
than as the result of tone deletion or tonal lowering processes. Since High tone 
is culminative within the PWord(min) domain, prosodic restructuring which re- 
duces the number of PWord (nin) automatically reduces the number of High tones 
which can be realized in the domain. It is striking that prosodic restructuring in 
Somali so closely parallels prosodic restructuring in Swedish. This leads us to 
propose that restructuring is typical of these kinds of tonal systems: i.e., with 
tonal culminativity within a prosodic domain. An important question for future 
research is whether Somali High tones are underlying or assigned to PWord nin) 
(depending on morphosyntactic information, à la Hyman 1981). While our analy- 
sis shows that High tone realization is conditioned by prosodic domain structure, 
the source of the High tones in the representation requires further investigation. 
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Chapter 8 


DP-internal structure and agreement in 
Nafara 


Bertille Baron 


Georgetown University 


Senufo Nafara DPs show the particularly rare unmarked word order [N AP Def 
Dem Numeral]. In this cartographic account, the proposed derivation uses spec-to- 
spec movement operations to generate this word order (Cinque 2005). This analysis 
relies on two main claims: there is a domain XP under Num (Aboh 2004) in which 
two distinct categories of adjectives (high and low following Cinque 1994) merge to 
modify the noun. It is this same XP domain that undergoes spec-to-spec movement, 
motivated for agreement purposes. Oppositely, all elements structurally above XP 
remain in situ. 


1 Introduction 


Nafara is a Senufo language! spoken in Korhogo, Cóte d'Ivoire. Determiner 
phrases in the language show the canonical word order [N Adj D Dem Num], 
like in (1) below. 


(1) lo tá -gəl gal korfi 
mango sweet -DEF.3.PL DEM.3.PL seven 
N Adj -D Dem Num 


‘these seven sweet mangoes’? 


‘Although there is some controversy over its classification (Manessy 1975; Naden 1989), Senufo 
is traditionally defined as a branch of the Niger-Congo, Gur subgroup (Westermann & Bryan 
1970; Bendor-Samuel 1971). 

"The data introduced and analyzed throughout this paper was elicited from a native speaker of 
Senufo Nafara. 


Bongiovanni, Phillip Weirich, Robert Botne & Samuel Gyasi Obeng (eds.), African linguistics 
across the disciplines: Selected papers from the 48th Annual Conference on African Linguistics, 


Bertille Baron. 2019. DP-internal structure and agreement in Nafara. In Samson Lotven, Silvina 
AM 143-158. Berlin: Language Science Press. 


Bertille Baron 


As we see here, attributive adjectives usually directly follow the noun. While 
they do not seem to show agreement, adjectives are apparently the host for the 
number and gender specific determiner.? The demonstrative directly follows the 
determiner, and inflects for number and gender also. Numerals are DP-final in 
the language, and do not show concord.* 

The DP word order in (1) is highly unexpected, as argued by Cinque (2005). 
First, cross-linguistically, Numerals in DP-final position are noticeably rare. 
Then, syntactically speaking, the derivation of this word order is usually con- 
sidered marked (Cinque 2005). 

This paper accounts for both the syntactic structure and derivation of Na- 
fara DPs, assuming the universal functional hierarchy [Dem > Num > Adj > N] 
adopted by Greenberg (1963), Cinque (2005) and others. In §2.1, I first discuss 
the location of each DP-internal element. I also draw distinctions between two 
separate categories of adjectives (that I refer to as high and low, corresponding 
to Cinque’s (2010) indirect and direct modification respectively). In §2.2, I moti- 
vate the movement of adjectives and noun only, responsible for the surface word 
order. Finally, in §2.3, I show that such movement is motivated for agreement 
purposes. 


2 DP structure and word order 


2.1 Assumptions and data 
2.1.1 The functional structure of DPs 


In a cartographic approach to DP structure, Greenberg (1963) and Cinque (2005) 
argue that the only possible initial order capable of universally generating the 
DP word orders attested in natural language is [Dem > Num > Adj > N]. Assum- 
ing the underlying structure in Figure 1, Cinque accounts for all possible word 
orders as the result of some type of movement of NP. Namely, he argues for com- 
binations of successive Spec-to-Spec movements with or without pied-piping of 
NP up the DP spine. 

Note here that, in Cinque’s account, Num, Dem, and A are heads of their own 
functional projections (henceforth FPs) above NP. Those FPs merge in the Speci- 
fier of phrases (here WP, XP, and YP) that are all sisters to an Agree node, head of 


>The noun classification adopted in this paper is adapted from those provided by both Manessy 
(1996: 22) and Carlson (1990: 76) for other Senufo varieties. Nafara shows a 5-gender system, 
where genders 1, 2, and 3 consist of countable nouns, and genders 4 and 5 are for non-count 
nouns. Considering the scope of this paper, I will limit my discussion to countable nouns. 

“This is true of all numerals except ‘one’ which shows gender agreement, as shown in 82.1.4. 
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Agr, P 
Agr, WP 
DemP 
W  AgrP 
Agr, XP 
Bai 
X  Agr,P 
Agr, YP 


Y NP 


Figure 1: the cartographic representation of DPs (adapted from Cinque 
2005: 2). 
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an AgrP where NP potentially lands when undergoing movement. From a Mini- 
malist perspective (Chomsky 1995), such a structure violates the economy prin- 
ciple by assuming that additional and potentially unnecessary elements merge 
into the syntactic structure. Therefore, and in an effort to preserve Cinque’s hi- 
erarchy, I assume the syntactic structure in (2) below, where functional elements 
all merge directly above NP and below D as heads of their own projections, and 
where AgrPs are eliminated.” Adjectives are not included in (2). Their location 
and behavior will be discussed more thoroughly in §2.1.5. 


(2) [pp D [Demp -— Dem [Nume ... Num [np ... N ]]]] 


The remainder of 82.1 will focus on the distribution and properties character- 
istic of each functional and lexical category represented in Nafara DPs. 


2.1.2 Determiners 


Nafara determiners agree in gender and number with the noun they determine. 
They obligatorily attach to the right edge of the noun in simple DPs, as in (3). 
However, in the presence of AP, they attach to the AP final element, as in (4). It 
is important to note here the variable placement of the adverb in the AP. 


(3) a lo -gal 
mango -DEF.3.PL 
‘the mangoes’ 
b. * gal lo 
DEF.3.PL mango 


‘the mangoes’ 


(4) lo [(del) tà /ta (tel) ]-gel 
mango [(very) sweet / sweet (very) ] -DEF.3.PL 


‘the very sweet mangoes’ 


Cinque (2005) does not account for determiners and how or where they surface 
in the DP. Because they are separate morphemes in Nafara, and because it is the 
only DP-internal element encoding definiteness, I take the definite marker to 
be the realization of the D head, merging as the sister to the highest DP-internal 
functional projection, as in (2) above. That D head is spelled out in situ, cliticizing 
to the final extended noun phrase element located in its specifier, as shown in 
82.2. 


"The present analysis for the surface DP word order in Nafara does not justify the need for 
AgrPs to be generated in the syntax. 
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2.1.3 Demonstratives 


Demonstratives also inflect for number and gender. They necessarily follow the 
determiner, as shown in (5). 


(5) a pom ta -bel bal 
apple sweet -DEF.1.PL DEM.1.PL 


‘these sweet apples’ 
b. *pom_ bal tá -bel 


apple DEM.1.PL sweet -DEF.1.PL 


‘these sweet apples’ 


Following Cinque’s (2005) account, I assume that they merge rather high in 
DP, as the head of a functional projection DemP. 


2.1.4 Numerals 


As shown below, numerals occur in DP-final position. Because the literature on 
numerals has not led to any consensus on their syntax, both cross-linguistically 
and within languages, and due to the fact that I have no further evidence to mo- 
tivate a more elaborate account for the distribution and properties of numerals 
in Nafara, I follow Cinque (2005) in assuming that they merge as the head of a 
functional projection NumP.Ó 


a. pom -u nua 
apple -DEF.1.SG one.1 
‘the one apple’ 

b. fi -g nurgo 
tree -DEF.2.SG one.2 
‘the one tree’ 


c. lo -n nuná 
mango -DEF.3.SG one.3 


‘the one mango’ 


$Cross-linguistically, numerals have either been analyzed as lexical or functional categories, 
showing great cross-linguistic differences (Danon 2012; Ionin & Matushansky 2006, among 
others). 
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(7) ‘seven’ 

a. pom -u korfi 
apple -DEF.1.PL seven 
‘the seven apples’ 

b. fi -i korfi 
tree -DEF.2.PL seven 
‘the seven trees’ 

c. lo -gel korfi 
mango -DEF.3.PL seven 


€ > 
the seven mangoes 


Note here that while ‘seven’ in (7) - as well as any other number in the lan- 
guage — does not inflect for gender, the number ‘one’ in (6) along with all num- 
bers compounded with ‘one’ do. For this reason, I argue that ‘one’ merges in Num 
with an unvalued gender feature [uGEN]. Following both Greenberg (1963) and 
Cinque (2005), I assume NumP to be below DemP in the functional projection 
hierarchy. The DP word order in Nafara is [N A D Dem Num] as in (1), repeated 
in (8) below. 


(8) lo tá -gəl gal korfi 
mango sweet -DEF.3.PL DEM.3.PL seven 
N Adj -D Dem Num 


€ à À 
these seven sweet mangoes 


2.1.5 Adjectives 


So far, I have argued that attributive adjectives directly follow the noun they mod- 
ify, as in (9a) below. The ungrammaticality of (9b) and (9c) shows that attributive 
adjectives that occur in postnominal position (that is to say most adjectives in 
the language) cannot occur in prenominal position. 


(9) a. lo tà -gal 
mango sweet -DEF.3.PL 
‘the sweet mangoes’ 
b. *ta lo -gal 
sweet mango -DEF.3.PL 


‘the sweet mangoes’ 
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* 


c. “tà -gal lo 
sweet -DEF.3.PL mango 


‘the sweet mangoes’ 


There are however certain adjectives that can only occur prenominally. It is 
the case for adjectives of nationality, as shown in (10a). 


(10 a. kodivwarefju -u 
Ivorian man -DEF.3.SG 


‘the Ivorian man’ 


b. *fju -u kodivware 
man -DEF.1.SG Ivorian 


‘the Ivorian man’ 


c. “fju kodivware -u 
man Ivorian -DEF.1.SG 


‘the Ivorian man’ 
d. * kodivware -u Du 
Ivorian -DEF.1.5G man 


‘the Ivorian man’ 


In (10b) and (10c), we see that the same adjective cannot occur after the noun, 
either preceding or following the determiner. This does not result in a semantic 
alteration of the DP, but in complete ungrammaticality. (10d) shows that such 
adjectives cannot be the host for the determiner clitic. 

Cinque (1994) argues that adjective phrases merge in the Specifier position 
of functional projections occurring in a fixed order above NP based on a strict 
hierarchy, as in (11). 


(11) [rp APF[APF[ APF [ypN]]]] 


The data above is somewhat surprising, as it contradicts the semantic hierar- 
chy for adjectives proposed by Cinque (1994) according to which adjectives of 
nationality merge lower than adjectives of quality (Cinque 1994: 96). One major 
difference here is that Nafara postnominal adjectives are ordered freely, as in (12) 
below. Therefore, as is, the data does not allow us to posit any specific ranking 
of adjectives of nationality relatively to other types of adjectives. 
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(12) a. 


pom tà tfe ne -bel 
apple sweet good red -DEF.1.PL 


‘the sweet good red apples’ 


. pom fe 


pe ta -bel 
apple good red sweet -DEF.1.PL 


‘the sweet good red apples’ 


b. pom tfe tá pe -bel d. pom pe tă fe bel 
apple good sweet red -DEF.1.PL apple red sweet good -DEF.1.PL 


'the sweet good red apples' ‘the sweet good red apples’ 


Due to the fact that both pre- and postnominal adjectives exist in Nafara, and 
because postnominal adjectives can be ordered freely, I follow Cinque (2010) in 
differentiating between two types of adjectives merging in two distinct positions 
in the syntax. While postnominal (low) adjectives merge as adjuncts inside NP 
(which explains their free ordering), prenominal (high) adjectives merge higher 
in the structure. Following Cinque, the latter are in fact reduced relative clauses 
(henceforth RC) merging in the Specifier of a projection above NP. 

Applying this analysis to Nafara, I propose the syntactic representation in (13) 
below, where the few semantically selected reduced RC adjectives in the lan- 
guage merge in the specifier of the extended nominal domain that I call =P fol- 
lowing Aboh (2004).’ 


(13) [pp ... D [Demp -— Dem [Nump ... Num [yp [AnignP] X [np — [np N] 
[AiowP]]]]]] 


Considering the word order [Ahigh N Ajow D Dem Num] in (14), I argue that 
high and low adjectives all remain in situ within XP during the entire derivation.® 


(14) kodivware lo fe tà -gəl gal korfi 
Ivorian mango good sweet -DEF.3.PL DEM.3.PL seven 
Ahigh N Alow Alow -D Dem 


'these seven good sweet Ivorian mangoes' 


Num 


In the following section, I show that only XP moves up the DP spine, gener- 
ating the surface word order. I will illustrate this derivation, and motivate it in 


83. 


7In his account for Gungbe - and ultimately for Gbe languages, Aboh (2004) argues in favor of 
an extended noun phrase that he calls XP. In Gbe (also Niger-Congo), XP constitutes the in- 
flectional nominal domain, and XP alone undergoes movement up the DP spine for agreement 
purposes, landing in Spec,DP. While XP in Nafara does not apparently constitute a particular 
inflectional domain, I posit its existence in the language as the extended noun phrase once the 
higher adjectives have merged. 

*Further data regarding complements to N is called for. The question is left aside and will be 
the object of future research. 
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2.2 Underlying structure and derivation 


In his DP typology, Cinque (2005) argues that all attested word orders are the 
result of NP undergoing some type or combination of movements. According 
to him, total roll-up movement (also referred to as Spec-to-Spec movement with 
pied-piping) is the least marked type of movement available. Regarding the word 
order [N A Dem Num] (which is the one we observe here in Nafara DPs), Cinque 
argues that it is rather rare, but derived as in (15). 


(15) (NA Dem Num[eral]) has a derivation in which NP raises past A, 
followed by pied-piping of the whose picture type past Num[eral], 
followed by raising of [N A] without pied-piping (marked) past Dem. 
(Cinque 2005: 323(61)) 


Cinque argues that what makes this word order rather uncommon is that it 
is somewhat marked. NP undergoes a combination of roll-up and Spec-to-Spec 
movement, as shown in Figure 2, adapted from Cinque (2005). 

In 82.1.5, I have established that low adjectives are adjuncts inside NP, and high 
adjectives merge in Spec,=P, an extended nominal projection. Therefore, NP no 
longer needs to raise past AP and then raise along with AP past Num. Instead, the 
entire XP, including the high and low attributive adjectives, moves up. Figure 3 
is a tree representation of the phrase in (16) where this movement is illustrated. 


(16) kodivware lo del tà -gal gal korfi 


Ivorian mango very sweet -DEF.3.PL DEM.3.PL seven 


'these seven very sweet Ivorian mangoes' 


The syntactic structure in Figure 3 clearly shows that, to obtain the desired 
word order [Apigh N Ajo, D Dem Num], XP must move up without pied-piping 
and eventually reach Spec,DP (to precede D, Dem, and Num in that order). There- 
fore, and contra Cinque in (15) and Figure 2, I argue that XP undergoes a uniform 
Spec-to-Spec movement without pied-piping all the way up. However, this move- 
ment must be correctly motivated. 

In 82.3, I demonstrate that movement is triggered by syntactic agreement with 
N. Due to some agreement effects that I will explain, XP is the phrase that gets 
selected, and raises to Spec,DP by way of Spec-to-Spec movement. 
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AgryP 


No pied-piping: marked Agr, XP 


NumP 


X | AgryP 


AP 


Figure 2: Syntactic derivation for the word order N A Dem Num 
(adapted from Cinque 2005) 
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DP 
[DEF] 
[GEN] 
[NUM] 
D’ 
[DEF] 
D DemP 
BA 
Dem' 
Dem  NumP 
B AM 
Num' 
Num XP 
korfi PA 
A 
| AP Se 
kodivware X NP 
I 
N 
N° AP 
N del ta 
lo 


Figure 3: Feature Percolation in Nafara DPs. 
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2.3 Agree and DP word order 
2.3.1 Agree 


As previously mentioned, number and gender are active features of Nafara nom- 
inal concord. The phrase in (1), repeated here in (17), shows that both the deter- 
miner and the demonstrative agree in number and gender. 


(17 lo tá -gal gal korfi 
mango sweet -DEF.3.PL DEM.3.PL seven 


H A 
these seven sweet mangoes 


Debatably, nominal concord has been argued to be the result of an operation 
called Agree (Chomsky 2000; 2001) also considered responsible for subject-verb 
agreement (Baker 2008; Carstens 2001; Collins 2004, among others). One version 
of the operation is provided in (18). 


(18) Agree (Norris 2014: 26) 
A probe X establishes an Agree relation with a goal YP, where: 


a. X c-commands YP, 
b. X lacks values for uninterpretable features that can be supplied by the 
values of matching features on Y, 

c. Y lacks values for uninterpretable features that can be supplied by X, 

d. No potential goal intervenes between X and Y, 

e. X and Y are in the same phase. 
Agree supplies the values of each category’s uninterpretable features from 
matching features of the other category, with the two features coalescing 
into a single shared feature. 


Let us now see how this operation can account for DP-internal agreement and 
word order in Nafara. 


2.3.2 Agree, EPP, and XP movement 


I argue here that D merges with a set of features that contains at least one valued 
feature: definiteness [DEF], and two unvalued features: number [Num] and gender 
[GEN]. Additionally, D merges with an EPP feature. Similarly, Dem merges with 
unvalued [Num] and [GEN] features, and the numeral arguably merges as the 
head of NumP with a gender feature that is unvalued (that is at least the case for 
‘one’, overtly inflecting for gender as shown in (6)). 
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By way of Agree, all these functional elements look down their c-command 
domain for the missing values. As N comes from the lexicon with both [NUM] 
and [GEN], it is the best possible goal for all of them to probe down to. 

While N is the best potential goal, it is XP that moves up. In fact, the same 
Agree relation through which DP-internal functional elements probe down to N 
selects XP to move up to Spec,DP. In his account for Estonian concord, Norris 
(2014) argues that the DP-internal agreement pattern observed in the language 
is made possible by a syntactic principle called Feature Percolation, presented 
here in (19). 


(19) Feature Percolation (Norris 2014: 135 (242)) 
a. All projections of a head X? have the feature-value pairs that X? has. 


b. Let [F:val] be a valued feature on XP. 
Let Z? be a head lacking the feature [F]. 
Let X° and Z? be members of the same extended projection (i.e., both 
[*N]). 
When Z? merges with XP, projecting ZP, ZP also has the valued feature 
[F:val]. 


According to (19), as heads merge into the structure with valued features, those 
same features percolate to the phrase those heads project. In Figure 4, I show how 
this principle applies in the Nafara DP. By way of Feature Percolation, and due to 
the fact that =P is nominal, I argue that the features [GEN] and [NUM] percolate 
all the way up to =P. 

Since agreement with N occurs all the way up the DP spine, I argue that =P 
undergoes Spec-to-Spec movement up to Spec,DP where the EPP feature on D is 
satisfied. 


3 Conclusions 


In this paper, I have accounted for the structure and word order occurring in 
Nafara DPs. Following Cinque (2005), I argue for D > Dem > Num > A > N to be 
the DP-internal hierarchy. Additionally, and following Cinque (2010), I argue that 
there are two distinct types of attributive adjectives, that I refer to as high and 
low, and that do not seem to be interchangeable in the language. Finally, I argue 
that the surface word order is derived by moving XP to Spec,DP, for agreement 
purposes and due to the presence of an EPP feature on D. Because it is structurally 
higher than XP, Num does not undergo movement. For this reason, it surfaces in 
DP-final position. The Spec-to-Spec movement of =P is illustrated in (20) below. 
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DP 
[DEF] 
[GEN] 
[NUM] 
D 
[DEF] 
D DemP 
[DEF] 
[uGEN] I 
[uNUM] Dem’ 
EPP Pat 
Dem NumP 
[uGEN] 
[uNUM] FN 
Num’ 
Num =P 
[uGEN] [GEN] 
[NUM] 
AP E 
[GEN] 
PR ee 
+ Ahigh.…. Pa 
= NP 
[GEN] 
[NUM] 
| 
N 
[GEN] 
[NUM] 
N’ AP 
[GEN] 
T ANN 
| low. 
N 
[GEN] 
[NUM] 


Figure 4: Feature Percolation in Nafara DPs 
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rem] 


(20) [or Deer eent, Dem Lune, Nom [zp Lap Anigh] X [up N [Ap Ass ll ] 


Abbreviations 

A(dj) adjective D determiner Num numeral 
1 Sender) DEF definite PL plural 
2 gender2 Dem demonstrative SG singular 
3 gender3 INDEF indefinite 
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Chapter 9 


Complement clause C-agreement 
beyond subject phi-agreement in 
Ikalanga 


Rose Letsholo 


University of Botswana 


Ken Safir 
Rutgers University 


In this essay, we provide a rich description and analysis of C-agreement and com- 
plement clause complementizer (CCC) distributions in Ikalanga which reveals a 
highly articulated set of relationships between matrix clause properties and the 
morphology of an agreeing complementizer. In particular, we show that the agree- 
ing CCC is sensitive not only to matrix subject phi-features, but to matrix voice and 
tense as well. Although the agreeing Cs are limited to a small set of predicates that 
are not fully predictable from meaning, the agreement facts are the same wherever 
an agreeing C is possible. 


1 Introduction 


Complementizer agreement with matrix verb subjects in languages like Lubu- 
kusu (Diercks 2013) raises many issues concerning the syntax of agreement and 
the status of agreeing complementizers, but rich descriptions beyond Lubukusu 
are rare. In addition to Lubukusu, C-agreement with the matrix subject has been 
attested for Kinande and Ibibio (Baker 2008), Mande languages (Idiatov 2010), 
Luvale, Luchazi, Chokwe and Lunda (Kawasha 2007), Limbum (Nformi 2017) and 
Kipsigis (Diercks & Rao 2019). Only the recent account of Kipsigis, however, 
comes close to the thorough description Diercks provides for Lubukusu. In this 
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paper we provide a rich description and analysis of C-agreement and comple- 
ment clause complementizer (CCC) distributions in Ikalanga, which uncovers 
a much more articulated set of relationships between matrix clause properties 
and the morphology of an agreeing complementizer. In particular, we show that 
the agreeing CCC is sensitive not only to matrix subject phi-features, but ma- 
trix voice and tense as well. Although we suggest an analysis, our main goal in 
this essay is to set boundary conditions on what any truly explanatory treatment 
must account for. 


2 The Ikalanga pattern 


Ikalanga CCCs for non-infinitival clauses include invariant kuti and kuyi, and 
two CCCs that agree with matrix subjects, AGR-ti and AGR-yi. The morpheme 
-ti is identical to the verb ‘say’ -ti and neither AGR-ti, nor any other CCC, is 
possible with main verb -ti. However, AGR-ti, when it occurs with other verbs, 
not only agrees with matrix subjects, but also partially agrees with matrix tense. 
We will argue further that both kuti and kuyi are also based on the -ti ‘say’ root 
which has the suppletive form -yi when it is passive. Kuyi, in at least some of its 
distribution, agrees for passive voice with the matrix verb. The ku- of kuti and 
kuyi is a default form where the c17 prefix ku- affixes to -ti (and suppletive -yi) 
in the absence of agreement. We argue that all the -ti-based complementizers 
are essentially the same form and that they are not verbs - grammaticalization 
is incomplete in an interesting way. This pattern has consequences for what the 
assumed locality relations are between matrix T(ense) and elements inside vP. 
Although sensitivity to voice and subject phi-features is feasible within the vP- 
phase in minimalist reasoning, interaction of a CCC with matrix T raises inter- 
esting questions that require further assumptions which we will explore. 

The following examples show that what we are calling the AGR-ti comple- 
mentizer agrees with the matrix subject for person and noun class. Examples 
(1a-1b) show agreement for noun classes c1 and c2 and examples (1c-1d) show 
agreement for noun class and person. The vowel /e/ in eti (1a) and beti in (1b) is 
underlyingly /a/ before regressive assimilation. The rest of the examples in (1) 
illustrate agreement for c3 through c10. The presence of -ka- as in -kati will be 
addressed later.! 


‘Our glosses diverge slightly from Leipzig Glossing Conventions in that we follow Afranaph 
Project glossing (Afranaph Project Ongoing) and in particular subject agreement is treated 
as subject marker (SM) to avoid prejudging the agreement or pronominal status of subjects in 
Bantu languages and more generally (also om for object marker). Noun class numbers are pre- 
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(1) a. ü-n6-dw-à Néó è-ti a-téng-é lori 

SM.c1-PRS-tell-fv Neo sm.cl-that sm.cl-buy-sBJv c9.car 
‘He/she is telling Neo that she should buy a car/to buy a car’ 

b. íbó — b-á-dw-á Nchídzi bé-tí á-tèng-è lórì 
PRN.C2 SM.c2-psT1-tell-fv Nchidzi sM.c2-that sm.cl-buy-sBJv c9.car 
‘They (pl) told Nchidzi that he should buy a car/to buy a car: 

c. nd-ó-dw-à Néé ndi-tí a-teng-é lori 
sM.c1.15'-prs-tell-fv Neo sm.c1.15-that sm.cl-buy-sByv c9.car 


‘I am telling Neo to buy a car’ 


d. iswi t-a-dw-a Nchidzi ti-ti a-téng-é 
PRN.C2.1*! sm.c2.15'-pst1-tell-fv Nchidzi sm.c2.1t-that sm.cl-buy-sBJV 
lori 
c9.car 
"We told Nchidzi that he should buy a car/to buy a car: 

e. mpání w-áká-dw-à nsü ü-kàti u-izel-é 


mophane.c3 sM.c3-PsT2-tell-fv acacia-c3 sM.c3-that sM.c3-sleep-sBJv 
"The mophane tree told the acacia tree to sleep: 

f. mipání y-áká-dw-à nsü i-kati u-izel-é 
mophane.c4 sM.c4-PsT2-tell-fv acacia-c3 sM.c4-that sM.c3-sleep-sBJv 


"The mophane trees told the acacia tree to sleep’ (lit. The mophane 
trees told the acacia tree that it (the acacia tree) should sleep.) 


Additional examples illustrating agreement for c5-c10 are to be found in Afra- 
naph Project (Ongoing), see example IDs #15250-15255. 


2.44 The Lubukusu pattern 


Our expectations of what C-agreement with matrix subject phenomena look like 
have been largely set by Diercks' (2013) analysis of Lubukusu, and so we outline 
some key features of the Lubukusu pattern as a background comparison for our 


zm 


ceded by “c” to facilitate Afranaph searches, as are other departures from Leipzig conventions. 
2 and noun class glosses determine plurality, as in sm.c1.1* 
which is first person singular where c2 would be plural. Third person is treated as default. 
Please see the list of abbreviations on page 184 for other glosses. Associative plural, bo- in 
Ikalanga, is informally glossed as “&”. nF (infinitive) is used instead of c17 in Ikalanga, which 
it may be identical to. For Kinande, rv is initial vowel. 


Person is marked with 1* and 
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account of Ikalanga. Lubukusu has three sorts of complementizers that are typ- 
ically used for indicative complement clauses: invariant bali, mbo, and AGR-li, 
the last of which agrees with the matrix subject. 


A. The root -li of AGR-liis related either to a copula or a c5 agreement marker. 
Indicative clausal subjects show c5 agreement with matrix subject agree- 
ment markers (sms). 


B. Afairly wide class of verbs (at least 19), including verbs of speaking, desire, 
perception and epistemic verbs, take indicative or subjunctive complement 
clauses that can have AGR-li. 


C. Lubukusu AGR-li agrees with the matrix subject of the verb selecting the 
complement clause, whether the matrix subject is an active agent or expe- 
riencer, or if it is the passivized object. 


D. Direct objects of double complement verbs do not block C-agreement with 
the matrix subject and the agreement is not keyed to perspectives. Diercks 
(2013) shows C-agreement with the matrix subject, not the information 
source in prepositional object position, as in (2) (glossing from source). 


(2 Khw-a-ulila khukhwama khu Sammy khu-li/*ali ba-limi 
IPLS-PST-hear from LOC 1Sammy 1PL-that — 2-farmers 
ba-a-funa ka-ma-indi. 
2s-PsT-harvest 6-6-maize 


"We heard from Sammy that the farmers harvested the maize: 


2.2 The role of the ‘say’ root in Ikalanga -ti based complementizers 


One difference from Lubukusu is that the lexical root of the Ikalanga agreeing 
CCC is morphologically identical to the root of a verb meaning ‘say’. When -ti is 
used as a matrix verb, no -ti complementizer is possible, be it kuti, kuyi or AGR-ti. 


(3) a. nd-à-tí Néé w-á-téng-à lóri 
sM.c1.1'-psti-say Neo sM.c1-PsT1-buy-fv c9.car. 
‘I said that Neo has bought a car? 
b. ù-kà-ti Nchídzi w-á-énd-à 
SM.c1.2nd-Pns-say Nchidzi sM.c1-PST1-go-fv 


"You say/are saying Nchidzi is gone/has left: 
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c. a-ka-ti Nchídzi w-á-énd-à 
SM.c1-PRs-say Nchidzi sM.c1-PsT-go-fv 
‘He says/is saying Nchidzi is gone/has left: 

d. Néó ó-nóó-ti b-àná  bá-zh-é. 
Neo sM.c1-FUT-say c2-child sM.c2-come-sBjv 
‘Neo will say (that) the children should come’ 


Since many languages have grammaticalized ‘say’ complementizers (see, e.g., 
Heine & Kuteva 2002), a natural hypothesis might be that AGR-ti not only looks 
like a verb, it is one, and that is why it is missing when the main verb means ‘say’. 
There are a number of reasons that we do not analyze AGR-ti as a synchronic 
verb. 

One is that the morphology borne by AGR-ti is impoverished by compari- 
son with main verb -ti. For example, recent past (PST1) of main verb -ti (3a), the 
present (3b), and the future (3d), are never used on the ‘AGR-ti’ that introduces a 
complement clause (whenever the main verb is not -ti). The suppletive -kà- which 
is prs for only the verb -ti in (3b) can occur on AGR-ti, but only in concordance 
with remote past.? 

Additional evidence that AGR-ti is not acting as a verb comes from contexts 
where the verb -ti is embedded after AGR-ti. [based on Afranaph Project Ongo- 
ing: ID559] 


(4) Edgar w-ákà-dw-à Bill à-kà-ti á-tí Mary 
Edgar sM.c1-PsT2-tell-fv Bill sm.c1-PST2-that sM.cl-say Mary 
u-noo-n-da 
SM.c1-PRS-OM.cl-love 
‘Edgar told Bill to say that Mary loved him: 


If àkáti were sufficient to embed the verb ‘say’, then we would not expect ati 
to follow it. In fact, there is another verb meaning ‘say’ or ‘speak’ in Ikalanga 
that requires AGR-ti as its complementizer. If AGR-ti were a main verb it would 
be redundant in (5). 


(5) Néó w-ákà-leb-a (à)-kà-ti Nchidzi w-ákà-téng-á lóri 
Neo sM.c1-PST2-say-fv sM.c1-PST2-that Nchidzi sM.c1-PST2-buy-fv car 
‘Neo said it that Nchidzi had bought a car. 


"Remote past (psT2) might be morphologically composed of recent past (PsT1) -a- and remote 
past (PST2) -ka- but we will represent it as undecomposed -aka-. This morphological analysis 
is not crucial in our discussion of C distributions. 
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Moreover, all matrix verbs that take subjunctive or indicative complements 
require that the CCC be either AGR-ti, kuti, or kuyi, an observation most natu- 
rally stated as (6) rather than a semantically inappropriate claim that every finite 
complement clause is introduced by the verb meaning ‘say’. 

(6) In Ikalanga, C is obligatory for any non-infinitival complement clause, 
unless the main verb is -ti. 


We conclude that -ti-based C in Ikalanga is not a verb, however closely it is 
historically related to the verb -ti, and -ti-based complementizers are fully dis- 
tinguishable from the verb -ti. We return to the relationship between -ti-based C 
and the verb -ti in §3.4, however. 


2.3 Sensitivity of AGR-ti to Tense 


Ikalanga departs from what we know of the other C-agreement languages in 
that AGR-ti is sensitive to the tense of the matrix verb. However, as just re- 
marked, AGR-ti morphology is often impoverished; Tense agreement on AGR-ti 
only matches the matrix verb for remote past -aká-. For any other matrix tense 
there is no tense affix on AGR-ti. Where -ti cannot bear PsT2, the non-agreeing 
kuti is used. 


(7) a Maria w-ákà-dw-a Nchídzi (à)-kà-ti á-bík-é 
Maria sM.cl1-PsT2-tell-fv Nchidzi sM.cl-psT2-that sM.cl-cook-sBJv 
‘Maria told Nchidzi that he should cook’ 


b. bo-Maria b-ákà-dw-à Nchídzi bá-kà-ti 
&-Maria sM.c2-PsT2-tell-fv Nchidzi sm.c2-psT2-that 
á-bík-é 


SM.c1-cook-sBJv 
‘Maria and others told Nchidzi that he should cook’ 


There are some verbs that take a slightly different -ti-based ‘as that’ complementizer (similar 
to the one found in Kinande) which we do not discuss. 
(i) bo-Nchidzi ba-no-zwi-tham-a se-u-nga-ti ba-no-lil-a 
&-Nchidzi sM.c2-PRS-RFM-make-fv as-sM.cl-as-that sM.c2-PRS-cry-fv 


‘Nchidzi and others are pretending to be crying: 
Or: ‘Nchidzi and others are pretending like they are crying" 
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(8) 


a. 


b. 


C. 


Né6 w-a-zwi-buzw-a (a)-ka-ti a Nchidzi 
Neo sM.c1-PST1-RFM-ask-fv SM.c1-PST2-that Q-part Nchidzi 
w-ákà-téng-á lóri 

SM.c1-PST2-buy-fv car 

‘Neo wondered whether Nchidzi had bought a car. 

* Néó u-no-zwi-buzw-a (a)-ka-ti a Nchídzi 
Neo sM-c1-PRS-RFM-ask-fv SM.c1-PST2-that Q-part Nchidzi 
w-a-ka-téng-a lori 
SM.c1-PST2-buy car 
‘Neo wonders whether Nchidzi had bought a car. 

Néó á-nóó-zwí-büzw-à kuti à Nchídzi w-ákà-téng-á 
Neo sM.c1-FUT-RFM-ask-fv that Q-part Nchidzi sM.c1-PsT2-buy-fv 
lóri tshwá 

car new 


‘Neo will wonder whether Nchidzi bought a new car: 


Thus in addition to the phi-features of the matrix subject, the shape of AGR-ti 
is also sensitive to tense. 


2.4 Sensitivity to Voice 


When a verb taking a clausal complement is passivized, the form of the CCC 
is typically kuyi, which does not permit any subject or tense agreement (9b is 
after ID654 in Afranaph), but some speakers (probably older ones) also accept 
agreeing AGR-yi (which we have verified allows agreement for all persons and 
noun classes, not illustrated here). 


(9) 


a. 


(imi) nd-àká-dw-à Nchídzi ndí-kà-ti á-téng-é 

I sm.ct.18t-psr2-tell-fyv Nchidzi sm.c1.1t-psT2-that sm.cl-buy-sByv 
lori. 

car 

‘I told Nchidzi that he should buy a car’ 

(imi) nd-aka-dw-iw-a kü-yí Mary 


I sm.ct.1t-psr2-tell-pass-fv sm.c1.15'-that Mary 
à-á-tó-ndí-d-á 

NEG-SM.c1-PST1-OM.c1.15t-like-fv 

‘I was told that Mary did not like me: 
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imi nd-aka-dw-iw-a ndi-yi Mary 
I sm.ci.18'-pst2-tell-pass-fv sm.c1.15'-that Mary 
a-a-td-ndi-d-a 

NEG-SM.c1-PST1-OM.c1.1t-like-fv 

‘I was told that Mary did not like me’ 


Even verbs like -budz- that allow both kuti and AGR-ti in the active favor the 
-yi form when the immediately superordinate verb is passivized, though kuti is 
always possible, as in (10a). Verbs that do not allow AGR-ti like -dum- also can 
take kuyi (but not AGR-yi) when they are passivized - compare (10c, 10d). 


(10) 


a. 


Néó ü-nd-èmül-à kü-büdz-íw-á  küyi/küti 4-tèng-é léri 
Neo sM.c1-Prs-wish-fv 1nF-tell-pass-fv that SM.cl-buy-sBJv car 

‘Neo wishes to be told to buy a car: 

Néó ü-nd-èmül-à kü-büdz-íw-á  é-yí á-téng-é lóri 
Neo sM.cl-PRs-wish-fv INF-tell-pass-fv sM.cl-that sM.cl-buy-sBJv car 


‘Neo wishes to be told to buy a car? 


Né6 ü-nd-düm-à kuyi/kuti Nchídzi w-ákà-bál-á 

Neo sM.c1-PRs-believe-fv that Nchidzi sM.c1-PsT2-read-fv 
bükà 

book.c9 

‘Neo believes that Nchidzi read a book: 

ku-ó-düm-iw-à küyí/küti Nchídzi w-ákà-bál-á 
SM.cC17-PRs-believe-PAss-fv that Nchidzi sM.c1-PsT2-read-fv 
bükà 

book.c9 


‘It is believed that Nchidzi read a book"? 


A number of points emerge in this data. First, both kuti and kuyi Cs are possi- 
ble for passivized verbs and for some speakers AGR- yi is also possible. However, 
AGR- yi is not normally possible when the matrix verb is not passivized. Also, no- 
tice that the forms like (10b) suggest that agreement for voice and agreement with 
the 'subject' are both distinct from tense agreement. (10b) shows c1 agreement 
on AGR-yi even though the infinitive has no subject agreement — it is marked 
only with the invariant c17 marker for infinitives. Thus the C must be agreeing 


“The sM.c17 appears to be ku- where /u/ becomes a glide before /a/ and deletes before /o/. Noth- 
ing turns on this. 
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with the PRO subject of the ‘to be told’ infinitive, determined to be c1 because 
it is controlled by Neo. This fact generalizes to AGR-ti, which also can agree for 
noun class with a controlled infinitival subject. 


(11) Néó wa-ka-bé-é-shak-a kü-dw-á  bàisáná é-ti 
Neo sM.c1-PST2-AUx-SM.c1-want-fv INF-tell-fv c2.boys sM.c1-that 
ba-téng-é lori 
SM.c2-buy-sBJv car 
‘Neo wanted to tell the boys that they should buy a car’ 


Subject clauses always take kuti. When an active verb like -sup-, ‘prove’, that 
takes only kuti in its complement clause is passivized, it can take either kuti or 
kuyi in its postverbal clausal complement. However, when the passivized clause 
is in sentential subject position (as marked by subject agreement on the verb), 
the C must be kuti, as in (12c), and it cannot be kuyi. 


(12) a. nsékísi W-ákà-süp-à küti Nchídzi w-ákà-kwíb-á 
prosecutor.1 sM.c1-PsT2-prove-fv that Nchidzi sM.c1-PST2-steal-fv 
márí 
c6.money 


"The prosecutor proved that Nchidzi stole the money: 


b. ku-ákà-sáp-íw-à küti/küyí Nchídzi w-ákà-kwíb-á 
c17-PST2-prove-PAss-fv that Nchidzi sM.c1-PsT2-steal-fv 
márí 
c6.money 


‘It was proved that Nchidzi stole money: 


c. küti Nchídzi w-ákà-kwíb-á márí kw-ákà-sáp-Íw-à né 
That Nchidzi sM.c1-PST2-steal-fv c6.money c17-PsT2-prove-PAss-fv by 
nsèkisi 
prosecutor 


‘That Nchidzi stole the money was proved by the prosecutor. 


This shows that the distribution of kuyi and AGR-ti is sensitive to voice. It 
suggests that kuti is necessary in subject position because passive does not c- 
command it. Notice further, that -sup- is a verb that does not take AGR-ti. Thus, 
even verbs that only co-occur with kuti in their active form also license kuyi. 
This shows that the kuti/kuyi alternation is independent of the kuti/AGR-ti alter- 
nation. The agreement and distributions of -ti and -yi are summarized in Tables 1 
and 2, respectively. 
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Table 1: Summary of the C-agreement morphemes 


Agreement features -ti -yi 
Subject Agr V v 
Tense Agr V v 


Table 2: Complementizer distribution of -ti in comparison with -yi 


Distribution -ti -yi 


Voice: Active 
Passive 


Mood: Subjunctive 
Interrogative 


S SN NS 
<x Se S SN X 


Introducing clausal complement 


2.4.1 Suppletion of -ti 


What makes it clear that there is an agreement relation between passive voice and 
kuyi or AGR-yi, however is the -yi shape itself. When main verb -ti is passivized, 
the root is suppletive, taking the form -yi, as illustrated in (13b), and unlike the 
AGR- yi C, it can be inflected for tense. 


(13) a. ku-aka-yi Nchídzi à-á-pó 
SM.c17-PST2-say.PASS Nchidzi Neg-sM.c1-there 
‘It was said that Nchidzi was not there’ 
b. kó-ó-yi Nchídzi á-téng-é lóri tshwá 
SM.c17-FUT-say.PASS Nchidzi sM.c1-buy-sBjv car new 
‘It will be said that Nchdizi should buy a new car: 


Thus it seems that AGR-yi could be a form of suppletion parallel to the -ti/-yi 
alternation of the main verb -ti. It is plausible to see this as a form of morpho- 
logical concord or agreement, especially since phi-agreement on AGR-yi is also 
possible when it is a complementizer (as in 14). Moreover, AGR- yi, just like AGR- 
ti, also permits tense agreement for PST2. 
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(14) Néó w-ákà-dw-iw-á á-ká-yi á-téng-é lóri 
Neo sM.c1-PsT2-tell-PAss-fv sM.c1-PsT2-that sM.c1-buy-SBJv car 


‘Neo was told to buy a car/Neo was told that she should buy a car’ 


It appears that verbs that most favor AGR-ti are those that permit AGR- yi, but 
we have not checked every case. 


2.4.2 Kuyi without passive concord 


Kuyi can be used for emotive complement clauses in the absence of passive mor- 
phology, but in these cases, it appears to simply be an infinitival passive comple- 
ment of the matrix verb. Thus no complementizer preceding infinitival kuyi is 
expected. Kuyi is functioning as a main verb, and as such, it is the one verb that 
does not introduce a C before its complement clause. Consistent with the infini- 
tival passive analysis, AGR- yi is not possible for any of (15a-15d) as illustrated 
by (15e). 


(15 a. Néó ó-nóó-chénám-à kü-yí á-teng-é lóri 
Neo sM.c1-FUT-surprised-fv INF-say.PASS SM.cl-buy-sBJv car 
‘Neo will be surprised to be told that she should buy a car: 
b. Néó ó-nóó-gwádzík-à kü-yí á-teng-é lóri 
Neo sM.cl-FuT-hurt-fv INF-say.PASS SM.cl-buy-sBJv car 
‘Neo will be hurt to be told that she should buy a car? 
c. Néó ó-nóó-sháth-á kü-yí á-téng-é lóri 
Neo sM.c1-FUT-happy-fv INF-say.PASS SM.cl-buy-sBJV car 
‘Neo will be happy to be told that she should buy a car: 
d. Néó ú-nòò-d-á kü-yí á-téng-é lóri 
Neo sM.cl-FuT-happy-fv INF-say.PASS SM.cl-buy-sBJv car 
‘Neo will be happy to be told that she should buy a car: 


e. * Néó t-noo-chénam-a è-yi a-teng-é lori 


Neo sM.c1-FUT-surprise-fv SM.c1-say.PASS SM.cl-buy-sBJV car 
‘Neo will be surprised to be told that she should buy a car’ 


Other cases that may be instances where kuyi acts like a passivized infinitival 
complement may include instances where it functions as an evidential. 


(16) Néó w-ákà-wh-á küti/küyí Nchídzi w-ákà-téng-á lóri tshwá 
Neo sM.c1-PST2-hear-fv that Nchidzi sM.c1-PsT2-buy-fv car new 
‘Neo heard that Nchidzi had bought a new car: 
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In these contexts, it appears that the truth of the proposition introduced by 
kuyi may have some sort of evidential import. Verbs like wh- ‘hear’ can also take 
kuti, as in (16), but when kuyi is used, the source of the information is evaluated 
differently. While (16) with kuti does not indicate how the information came to 
the matrix subject hearer (someone could have made this claim directly to the 
matrix subject), with kuyi it indicates that that the matrix subject heard that it 
was said that Nchidzi had bought a car. This indicates further distance between 
the matrix subject and the evidence.” 

At present, it is not clear that the infinitival passive complement analysis is 
appropriate for (17a-17b), that is, whether the agreement is about what is said or 
about what is believed. These examples require further study. 


(17 a. Néó w-4kà-düm-à küyí Nchídzi w-ákà-téng-á lóri tshwá 
Neo sM.cl-pstT2-agree-fv that Nchidzi sM.cl-pst2-buy-fv car new 
‘Neo agreed that Nchidzi had bought a new car. 
b. ingwi m-ó-lándül-à küyí Nchídzi w-ákà-téng-á lóri tshwá 
You.pl sM.c2-Pns-refute-fv that Nchidzi sM.c1-PsT2-buy-fv car new 
"You (PL) refute that Nchidzi has bought a new car? 


Apart from instances where kuyi is a passivized infinitive (and cases like 17a- 
17b), however, the concord approach to the appearance of kuyi in place of AGR- 
ti still seems like the best generalization, especially given the sentential subject 
facts. The generalization in (18) seems to capture the central pattern. 


(18) The -ti-based complementizers are allomorphs. 


In addition to the fact that all of these -ti/-yi-based forms are almost in comple- 
mentary distribution (if the optionality of (10a), (10d) and (12b) arise from com- 
peting analyses), they also all occur in the same high clausal position, above 
the question particle that can introduce a matrix yes-no question as well as an 
indirect yes-no question. The verb -buzw- can take either AGR-ti (as in 8a) or 
invariant kuti. The presence of the Q-particle, also used in matrix questions, is 
obligatory. 


*Moreover, it is possible for a source phrase to be licensed for (16), though it is not clear whether 
it is kuyi or ‘hear’ itself that makes the source phrase possible, since kuti is also possible here. 


Gi)  Néó w-ákà-wh-á küyí Nchídzi w-ákà-téng-á lóri ndí John 
Neo sM.cl-PsT2-hear-fv that Nchidzi sm.cl-pst2-buy-fv car by John 
‘Neo heard from John that Nchidzi had bought a car? 
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(19) a. Néó w-aka-zwi-buzw-a  (à)-kà-ti a Nchídzi 
Neo sM.c1-PST2-RFM-ask-fv sM.cl-psT2-that Q-part Nchidzi 
w-ákà-téng-á lóri tshwa 
sM.cl-psT2-buy-fv car new 
‘Neo wondered whether Nchidzi had bought a new car’ 

b. Néó w-aka-zwi-buzw-a  küti à Nchidzi w-aka-téng-a 
Neo sM.c1-PST2-RFM-ask-fv that Q-part Nchidzi sM.c1-PsT2-buy-fv 
lori tshwa 
car new 
‘Neo wondered whether Nchidzi had bought a new car’ 

c. "Néó w-aka-zwi-buzw-a kon Nchidzi w-aka-téng-a Jon tshwa 
Neo sM.cl-PsT2-RFM-ask-fv that Nchidzi cl-pst2-buy-fv car new 
‘Neo wondered whether Nchidzi had bought a car? 

d. Néó w-ákà-bázw-iw-á küyíà Nchídzi w-ákà-téng-á 
Neo sM.cl-PsT2-ask-PAss-fv that Q-part Nchidzi sM.c1-Psr2-buy-fv 
lóri 
car 


‘Neo was asked whether Nchidzi had bought a car: 


2.5 More on the morphology of AGR-ti 


There are some deformations in the shape of AGR-ti that do not simply result 
from morphologically composing the sm, the (remote past) tense, and -ti. Al- 
though we alert the reader to them here, they do not change the basic gener- 
alization, namely, that the possible forms of AGR-ti (and AGR- yi) are entirely 
predictable from the sm and tense of the matrix verb. 

We have shown that remote past -(a)ká- shows up on AGR-ti when it also 
appears on the matrix verb. However, if the matrix verb is negated then the shape 
of matrix remote past tense is affected, surfacing as zo- instead of -ká. In this case, 
ká- can still appear on AGR-ti, as in (20a). 


(20) a. Né à-á-zó-dw-à Nchídzi á-kà-ti 
Neo NEG-SM.c1-NEG.PST-tell-fv Nchidzi sM.c1-Psr2-that 
â-tèng-è lori 
sM.cl-buy-sBJV car 
‘Neo did not tell Nchidzi that he should buy a car? 
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b. Néó à-á-zó-dw-à Nchídzi è-ti a-téng-é lori 
Neo NEG-SM.cI-NEG.PST-tell-fv Nchidzi sM.c1-that sM.cl-buy-sBJv car 
‘Neo did not tell Nchidzi that he should buy a car: 


c. (iwé) à-à-zó-dw-à Nchídzi on 
You wEG-sM.cl.2?3-Nrc.PsT-tell-fv Nchidzi sm.c1.2”4-that 
á-téng-é lóri 


SM.cl-buy-sBJv car 
"You did not tell Nchidzi that he should buy a car’ 


In (20b) where ka- optionally does not appear, sM a- will regressively assimilate 
raising to front e- (we do not know why the /a/ of ka- does not undergo regressive 
assimilation before the /i/ of kati). The high u- of sm.c1.2"4 would not be affected, 
as in (20c). 

The paradigm for the sm followed by the complementizer -ti is presented in 
Table 3 (the tone of -ti is influenced by its context). 


Table 3: sm paradigm 


SM.cl3!'d sm.c1it sm.c1.2Ÿ ~~ sm.c2.3% sm.c2.15t — sm.c2.2"4 


n ` H 


u/wa/a ndi ü bá ü mü 


It is not clear whether or not sM.cl, which is 31d if unmarked, is wá- when 
it precedes pst configuration or if it is just /ú/ fused with the per -a- that fol- 
lows it (as suggested in Letsholo 2002), raising the tone on /a/. For subjunctive, 
interrogative and negated sentences, c1 C-agreement appears to have the form 
a-. 

The paradigm for sm-ti combinations when there is no Psr2 matching with 
the AGR-ti is as in Table 4. When the matrix has sM.c2, the /a/ of ba- regressively 
assimilates to /e/ before the /i/ of -ti, as does sM.c1 when it is a-. 

If ka occurs on AGR-ti matching matrix PST2, the outputs in Table 5 are possi- 
ble (note that the sm we are treating as a- is the only one that is optional before 
-ka). 

Where ka cannot appear, the corresponding forms in Table 2 are possible. We 
assume that hati is a suppletive option. 

We stress, however, that the morphological details of the forms of AGR-ti do 
not obscure the main point that interests us in this essay, namely, that the possi- 


See also Chebanne (2010: 73). For more complete accounts of subject agreement and their re- 
lations to tense, see Mathangwane (1999) and Letsholo (2002). 
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Table 4: sM-ti paradigm without PsT2 


sM.cl-ti sm.cl.18'-ti. sM.cL294-ti sM.c2-ti sm.c2.15t-ti  sm.c2.24-ti 


u/é-ti ndi-ti u-ti bè-ti ti-ti mu-ti 


Table 5: sM-PST2-ti paradigm 


SM.cl3!d sm.c1.15t SM.cl2*?d  sw.c2.3!d = sm.c2.18t sm.c2.2d 
-PST2-ti -PST2-ti -PST2-ti -PST2-ti -PST2-li -PST2-fi 


(á)-kà-ti hàtì/ndì-kà-tì ù-kà-tì bá-kà-tì tí-kà-tì mú-kà-tì 


ble forms of AGR-ti are entirely determined by its relation to matrix tense, phi- 
features and voice.” 


"There are one or two deviations from the facts as described that we do not understand, such 
as the following example, where ka appears in AGR-ti when we do not expect it to. 


(i)  (iwé) ù-nó-léb-á ù-kà-tì Nchídzi w-ákà-téng-á lóri 
you sM.cl.2"3-pns-say-fv sM.c1.2'*-pns-that Nchidzi sM.c1-PST2-buy-fv car 


"You are saying that Nchidzi bought a car? 


It is possible that this is an instance where the suppletive kati form for main verb PRS-ti 
is echoed on AGR-ti, in which case we also see tense concord for PRS, at least in these cases. 
We have no more to say about such examples, though they deserve more research. It has been 
suggested to us that -ka- could be understood as a consecutive marker of some kind, both in 
its position on the verb stem and its position on the CCC. It is also the case in Ikalanga that ka- 
can be a consecutive marker as in other Bantu languages, as illustrated in the examples below 
(tones omitted): 


(ii) Neo w-aka-tem-a miti, ka kubunganya maswazwi, ka  a-pisa 
Neo SM.c1-PST2-cut-fv c4.tree CONS gather c6.branches CONS oM.c6-burn 


‘Neo cut the trees, gathered together the branches and burned them: 


(iii) Ingwi |^ m-aka-tem-a miti, mu-ka kubunganya maswazwi, 
PRNC2.2™ sm.c2.2"4-pst2-cut-fv c4.tree SM.c2-CONS gather c6.branches 
mu-ka-a-pis-a 
SM.c2.2"-coNs-oM.c6-burn-fv 


"The men cut the trees, gathered together the branches and burned them: 


We are not sure if the events in (i) can be said to be consecutive in nature in the way that 
examples (ii) and (iii) are. We do not see a way to relate these observations to ka on C. 
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2.6 The paucity of AGR-ti-taking verbs vs. invariant kuti 


It would seem that selection for AGR-ti is a largely lexical affair limited to only 
a few verbs, including the roots -buzw- ‘ask’, and -leb- ‘speak, say’, -budz- ‘tell’ 
(21c) and -dw- ‘tell’ (an instruction), which have all been exemplified, in addition 
to -dum- ‘concede/agree/believe’, -landul- ‘disagree’, and -alakan- ‘think’ (not 
illustrated here). By comparison, recall that Diercks cites at least 19 Lubukusu 
verbs which permit AGR-Ii. 


(21 a. Né w-ákà-düm-à (à)-kà-ti Nchídzi 

Neo sM.c1-PST2-agree/concede-fv sM.c1-PsT2-that Nchidzi 
w-aka-téng-a lori 
SM.c1-PST2-buy-fv car 
‘Neo conceded that Nchidzi had bought a car. 

b. Néó w-ákà-lándül-à (à)-kà-ti Nchídzi 
Neo sM.c1-PST2-disagree-fv sM.c1-PST2-that Nchidzi 
à-á-zó-téng-á lóri 
NEG-SM.cl1-PRS-buy-fv car 
‘Neo disagreed (saying?) that Nchidzi has not bought a new car’ 

c. nd-àká-kü-büdz-à ndi-tí u-teng-é lori 
sm.c1.15t-psT2-0m.c1.2"d-tell-fv sm.c1.1%-that sm.cl.2"4-buy-sByv car 


‘I told you to buy a car? 


Consultants differ as to whether or not -dw- can also take kuti complements, 
but they agree that -leb-, -budz-, and -buzw- can (see 19a-19b for buzw-). 


(22) a Néó w-ákà-léb-á küti Nchídzi w-ákà-téng-á lóri tshwá 
Neo sM.c1-PsT2-tell-fv that Nchidzi sM.c1-PST2-buy-fv car new 
‘Neo said that Nchidzi bought a car’ 


b. (imi) nd-àká-ká-büdz-à küti/hà-ti 
I sM.c1.15'-psT2-om.cl.2"4-tell-fy that/sm.c1.15t-that 
u-teng-é lori 
sM.cl.2"¢-buy-sByv car 


‘I told you to buy a car? 


However, as we saw in the case of (12a-12b), whatever determines the distri- 
bution of kuyi concord applies more generally than whatever determines which 
verbs allow AGR-ti. 
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So far, we have no indication that there is a successful generalization about 
verbs that take kuti, other than that kuti may be a default for verbs that take 
indicative or subjunctive complements when AGR-ti is not available. In some 
cases it is still available even when AGR-ti is a possible choice. Evidence that kuti 
is, at least in some contexts, a default form, is that it is always used for sentential 
subjects, where the clause determines sM.c17, even for a verb like -dum- that we 
know to be an AGR-ti-taking verb (in contrast to -sup- in 12b). 


(23) kütibàthü  bà-njinji à-bá-tó-thóph-à 
that c2.people c2.Agr-many NEG-SM.c2-NEG.PRS-vote-fv 
kw-áká-dümí-gw-àn-à 
SM.c17-PST2-agree-PASS-RCM-fV 


"That many people don’t vote was agreed on! 


As noted earlier, this suggests that in clausal subject position, kuti is in a po- 
sition where there is no c-commanding tense, voice, or subject to agree with. 


2.7 The status of the allomorphy hypothesis 


We suggested in (18) that all -ti-based Cs are allomorphs of each other, and at 
this point, there is reason to believe that the distribution of AGR-ti, AGR-yi, and 
kuti is predictable from the voice, tense and subject phi-features of the matrix 
verb, once we determine which predicates permit AGR-ti. We have treated kuyi 
as potentially in concord with passivized verbs in some contexts, but in others 
where it appears to be evidential, we treat it as the infinitival passive form of 
AGR-ti. 


3 Theoretical questions about locality 


If syntactic relations like agreement are always phase-internal, a central tenet in 
minimalist theorizing, then we must determine whether all the agreement rela- 
tions we posit are phase-local. In typical phase-based accounts, C and v or Voice 
are the phase heads. There is discussion in the literature (Kratzer 1996; Harley 
2013; Legate 2012; Safir & Bassene 2017) concerning whether the phase edge is the 
functional head v, which determines that a root is verbal or a higher Voice head 
(which takes vP as a complement). Following Kratzer and others, we assume that 
Voice selects vP and can introduce the external argument (EA). Alternatively, it 
might be assumed that the EA is introduced in Spec vP and can raise to Spec 
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VoiceP (in parentheses in 24), thus inhabiting the phase edge.? Thus the maxi- 
mal span of locality extends from the edge of the VoiceP downward to the edge 
of the CP (C, Spec CP, and adjunctions to CP, if these are different from Spec CP) 
The Voice phase does not include the higher (bolded) T or anything above it, nor 
anything below AGR-ti (C) such as the lower (bolded) TP. 


(24) [T [voicep (EA) [voice Voice [yp EA v ...[cp ...AGR-ti [Tp ...]]]1]] 


In 83.1 we examine the phase-internal relations between the EA and Voice 
with respect to AGR-ti and in 83.2-3.3 we suggest an analysis based on Voice 
agreement that could instantiate the allomorphy analysis proposed in (18). 

Notice now that matrix T and AGR-ti do not share a phase in (24). Thus we 
would not ordinarily expect any agreement relation to hold between those two 
heads, unless AGR-ti or a phrase below Voice that contains AGR-ti somehow 
escapes to the edge of VoiceP. We consider "escape" strategies in sections $83.3- 
3.5. 


3.1 Agreement internal to the VoiceP 


On the view that a passivized object passes through Spec VoiceP (and perhaps 
through Spec vP, where v selected by passive Voice does not assign an EA), we 
may expect that agreeing C in some languages would be sensitive to whatever 
inhabits the argument position that is most local to the Voice head. In Lubukusu 
(25) (from Diercks 2013: 368, glosses ours), the verb agrees with the passivized 
subject. 


(25) a. (Ese) n-a-bol-el-a Nelsoni n-di ba-keni 
I SM.c1.1*-psT1-say-APPL-fv c1.Nelson sM.c1.1st-that c2-guests 
ba-a-ch-a 


SM.c2-PST1-go-fv 
‘I told Nelson that the guests left’ 

b. Sammy  ka-bol-el-w-a a-li  ba-keni b-ol-a 
cl.Sammy sM.c1-say-APPL-PASS-fv c1-that 2-guests sM.c2-arrive-fv 


‘Sammy was told that the guests arrived. 


*It is possible that VoiceP is always the phase edge, but only occasionally introduces the EA. We 
will assume that only some light verbs are banned from introducing their EA in Spec V, such 
as CAUS, while most other verbs assign their EA in Spec vP. Arguments against assigning EA 
to Spec vP in Pylkkànen (2008) are primarily based on evidence about cAvs. Any argument 
exiting VoiceP will have to pass through its edge, however. See also Safir & Bassene (2017) for 
discussion of the left periphery of the verbal domain. 


176 


9 Complement clause C-agreement beyond subject phi-agreement in Ikalanga 


That such a relationship between Spec VoiceP is possible in both active and 
passive is what would be expected, but this is not what we find in Kinande, nor is 
it what we always find in Ikalanga. In Kinande, another C-agreement language, 
agreeing C (agreeing with matrix subject only) in the active reverts to default 
non-agreeing form in the passive, which suggests that it is primarily sensitive to 
passive voice, not agreement with a DP. Thanks to Prof. Philip Ngessimo Mutaka 
for these Kinande examples. 


(26) a. Yoháni mwasirisyakumbusy’ abakolhw’ ati balwé b'erisom' echapítre 
2 
Yohani mo-a-sirisya-buki-a a-ba-kolho  a-ti ba-lue 
John mo-sM.cl-FUT-remind-fv Iv-c2-student sM.c1-COMP SM.c2-AUX 
ba e-ri-som-a  e-chapitre 2 
C2.LK IV-INF-read-fv 1v-chapter 2 
‘John reminded the students that they should read chapter 2° 


b. abakolhó móbásirisyabukibw' ambu bálwé b'erisom echapítre 2 
a-ba-kolho | mo-ba-sirisya-buk-i-bu-a ambu 
IV-c2-student mo-sM.c2-FUT-remember-CAUS-PASS-fv COMP 
ba-lue ba  e-rissom-a  e-chapitre 2 
SM.C2-AUX C2.LK IV-INF-read-fv 1v-chapter 2 


"The students were reminded that they should read chapter 2° 


However, the shift to ambu has a semantic consequence, namely, it removes the 
speaker's responsibility for the following proposition, and which echoes a similar 
distinction related to non-agreeing voice in Ikalanga perception complements. 
Nonetheless, the shift from the agreeing C to the non-agreeing C suggests that 
it is passive Voice in Kinande that fails to facilitate C-agreement with the matrix 
subject. 


3.2 What we know so far 


The strongest theory of the distribution of kuti, kuyi, AGR-ti and AGR-yi is that 
they are allomorphs (apart from the infinitival passive kuyi), as this imposes cer- 
tain analytic requirements that stipulated distributions do not. By definition, the 
form of the C should be predicted by its syntactic and/or morphological context, 
so it is a rule that determines the allomorph. After all, the AGR-yi/ AGR-ti al- 
ternation is not a predicate specific relation even if the availability of AGR-ti is 
predicate specific and must be stipulated somehow. These, then, are the alterna- 
tions predicted. 
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(27) The allomorphy theory as driven by Voice 
a. AGR-ti alternates with AGR-yi when the superordinate verb is active 
or passive, respectively. 
b. Kuti is almost always available as a default where the superordinate 
verb is active or passive. 


c. Kuyi is possible when the superordinate verb is passive 


We have distinguished two kinds of verbs that take finite CP complements in 
terms of the C alternation that they permit. There is a small class of verbs that 
select AGR-ti and those verbs participate in the AGR-ti/AGR-yi alternation. Just 
about all verbs can take kuti complements. Passivized, these verbs allow for a 
kuyi C when the CP is postverbal. Some verbs take kuyi complements that are 
understood as impersonal infinitival passives, but kuyi is acting as a verb in these 
cases. 

We have also established certain relations that have to be captured in any 
analysis of the Iklalanga phenomena. 


(28) a. C-Agreement for voice is independent of agreement for tense (see 
10b). 
b. C-Agreement for phi-features with active or passive voice is 
independent of tense (see 10b and 11). 


c. C-Agreement for phi-features is independent of matrix agreement on 
T (as in 10b and 11). 

d. C-Agreement for tense is only possible if there is agreement for 
phi-features. 


In what follows, we argue that (28a-28c) are captured by tying C-agreement 
relations to Voice, and we postpone discussion of (28d), which is licensed by a 
different locality relation, until $3.5. 


3.3 Modeling Voice agreement with C 


Up to this point, we have been making the case that the relation between Voice 
and -ti complementizers is one of agreement in a descriptive sense, that is, the 
-ti/yi alternation covaries with passive marking on the verb. Although Diercks 
(2010: 353-370) discusses and rejects modeling Lubukusu C-agreement as agree- 
ment with Voice, we believe modeling C-agreement as a Voice-C relation is the 
best account of Ikalanga and perhaps correct for Lubukusu as well. As Diercks 
points out, a Voice-C relation suggests why direct objects or other intervening 
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nominals do not shift agreement to non-subjects, since Voice only agrees with 
the nominal closest to it, namely, the EA or a passivized nominal that passes 
through Spec-VoiceP (or Spec-vP). Diercks rejects the Voice-C model, however, 
because he assumes that the causative affix (CAUS) takes VoiceP complements, 
and if so, the affected subject, rather than the subject of CAUS, should control 
agreement. Under different assumptions about the position of the EA, Voice-C 
relation may actually produce the right result. If most predicates assign EAs to 
Spec vP, but only certain light verbs like CAUS (a species of v) assign their EA 
in Spec VoiceP (see fn. 8), then CAUS can take a vP complement that has an EA, 
but would not embed a Voice projection below it. The structure is illustrated in 
(29). 


(29) [Voicep EA [ Voice [yp [y CAUS [vp EA [y V... [cp [C... ]] 1] 1] N 


If this approach is viable, then only the causer argument can antecede Voice, so 
only the causer argument can control C-agreement. This is the right prediction 
for Lubukusu, but in Ikalanga (30) the introduction of a causative affix blocks 
C-agreement altogether, perhaps as a form of defective intervention that neither 
the movement theory nor our agreement theory fully predicts. 


(30) ba-isana b-aka-buzw-is-a Neo mme-abe kuti kene 
c2-boys sM.c2-PsT2-ask-cAvs-fv Neo mother-hers that whether 
b-aka-teng-a ma-bisi. 

SM.c2a-PST2-buy-fv c6-melons 
"The boys made Neo ask her mother whether they had bought 
watermelons. 


Thus treating the Voice-C relation as the core of C-agreement has advantages,” 
especially in Ikalanga, where we have morphological evidence that the voice of 
C covaries with the voice of the immediately superordinate verb. 

At this point, more technical issues arise as to how “agreement” is to be 
modeled within a theoretical approach. In minimalist theories, such as Chom- 
sky (2001), agreeing heads with unvalued features are “probes” which search 
for “goals” (typically nominals) in order to value their (probing) features. This 
probing operation is called “Agree”. Theorists divide over whether all Agree is 


?Diercks (2010: 367-369) rejects the Voice-C relation as a model for C-agreement in Lubukusu 
in favor of a control analysis, in part because of examples where C-agreement holds for clausal 
complements to direct object nouns. These cases, also found in Ikalanga, are indeed puzzling, 
but they are so for all accounts, including the proposal of Diercks et al. (2017), discussed in 
83.5. 
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downward-looking, upward looking, or both. They also disagree about whether 
or not Agree is the operation that ensures that anaphors agree with their an- 
tecedents. Following Rooryck & van den Wyngaerd (2011), Diercks et al. (2017) 
assume that anaphoric features of AGR-ti raise to adjoin to the vP (or VoiceP) 
phase head where they can probe down to get the features of the EA. Then these 
agreement features are morphologically realized only in the C position, not the 
higher vP adjoined position. The inheritance of morphologically realized features 
downward to a copy does not follow from Agree or copy theory, however. The 
only overt evidence they offer for C-movement to vP is the behavior of ‘say’ verbs 
in languages like Ikalanga that don’t allow C to follow them. Although we adopt 
a version of their analysis for the Lubukusu ‘say’ verb below, the motivation we 
provide does not justify generalized C-to-vP (or VoiceP) adjunction. 

We agree with Diercks et al. that Voice and C are in an antecedent-anaphor 
relation, but we do not treat the antecedent-anaphor relation as an Agree rela- 
tion. Rather it is a morphological relation that results from anaphoric elements 
that find their antecedent features within the same phase, as proposed in Safir 
(2014). The relation between Voice and the EA may also be seen as anaphoric. 
Thus Voice gets phi-features from the EA or whatever passes through its Spec 
VoiceP, and then an anaphoric voice feature on C is anteceded by Voice along 
with its agreement features that were anaphorically valued by Spec VoiceP. On 
this model, phi-agreement on AGR-ti and AGR-yi only proceeds by virtue of 
agreement with Voice. When Voice is active, the agreeing C is AGR-ti and when 
Voice is passive, we get AGR-yi. Since we could not find a semantic generaliza- 
tion that characterizes the small class of verbs that license agreeing C, selection 
for agreeing C must be treated as a lexical matter. There are various ways to 
stipulate this, but how that is done need not require anything special, so we do 
not address it here. 

Our account predicts that agreement on Voice is independent of agreement 
introduced by finite T. As (11) illustrates, this is the correct prediction, since AGR- 
ti agrees with the subject of an infinitive (presumably PRO) in the absence of 
finite agreement introduced by T. 


As Diercks et al. (2017) point out, the “delayed valuation" of Carstens (2016), which allows a 
probe to search “up” if there is no agreement goal locally c-commanded by the probe, appears 
to predict that nominals intervening between C and the superordinate subject should result 
in agreement with non-subjects. Carstens argues that the intervention does not occur due to 
independent factors, but those factors deserve further scrutiny. Both Diercks et al. and the 
proposal made here do not require additional mechanisms to make the right prediction with 
respect to this intervention. 
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There are few verbs, if any, that take finite complements and are completely 
incompatible with kuti. We propose that the kuti/kuyi alternation arises when 
the anaphoric feature on -ti has no phi-feature, so the anaphoric feature on -ti is 
only sensitive to passive/active. The absence of an anaphoric feature on -ti also 
yields kuti, as in cases where there is a finite sentential subject. The C of the CP 
subject is not anteceded by Voice and so cannot successfully agree. 

Voice sensitivity ascribed to AGR-C may provide insight into what is going 
on in Kinande and Lubukusu. In Lubukusu, C-agreement with the subject is also 
possible with matrix active verb subjects as well as passivized subjects. As in 
Diercks’s (2010) hypothetical account of Voice agreement, we assume that phi- 
agreement is parasitic on Voice in Lubukusu, but unlike Ikalanga, voice has no 
morphological exponent on C. Kinande, like Lubukusu, has no C-morphology 
for voice. In Kinande, however, when the matrix verb is passivized, C-agreement 
is blocked. This would follow if passive Voice in Kinande is not anaphoric for 


phi-features, thus has none to transmit." 


3.4 Raising of -tito v 


Diercks et al. (2017) suggest that the reason ‘say’ verbs in some languages resist 
taking any complementizer is that the ‘say’ C in those languages raises into the 
matrix clause. In their account, it is raising to an adjoined position on vP or 
VoiceP and it is independent of whether or not there is a lexical matrix verb. By 
contrast, we propose to limit the raising of ‘say’ C by permitting it only when 
it fills an empty v position in the matrix clause. Both our account and Diercks 
et al., however, assume that the -ti root begins as a complementizer, not as a verb, 
as ‘say’ complementizers are more commonly speculated to be. In Ikalanga, this 
would be manifested by the -ti root taking on verbal status and being licensed to 
bear verbal morphology, which it cannot do when it remains the head of C. 


(31) [TP T [Voicep EA [Voice [yp [y v [cp -ti [rp ...]] 1 11 ] 


a. 
b. [TP T [voicep EA [Voice-v-ti [yp [y v-ti... [cp -ti [rp ...]] 1 J] ] 


Raising of C to v is illustrated in (31b) (copies bolded), presumably with sub- 
sequent raising of v to Voice, part of the phase edge, where agreement with T 


"A further challenge is raised by C-agreement in Limbum. Nformi (2017) reports that direct 
objects block c-agreement in Limbum, which otherwise looks just like Lubukusu in the relevant 
respects. Accounts designed for the Lubukusu and Ikalanga patterns so far do not account for 
Limbum. Nformi proposes that the direct object blocks Agree between upward-probing C and 
the EA (see also Carstens 2016, on Lubukusu). 
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is possible. This limited account of ‘say’ C-raising is thus morphologically moti- 
vated, but there is an independent reason to suppose that the v corresponding to 
‘say’ might often be morphologically null. 

Grimshaw (2015) proposes that the semantics of ‘say’ verbs follows a strikingly 
regular semantic decomposition based on what she calls the ‘say’ schema. She 
notes that English say is a kind of vanilla report of a particular form of speech 
act that has at least two arguments, a source and a “linguistic material” argu- 
ment that amounts to content of what was said (setting aside the addressee or 
goal argument). One can talk nonsense or speak words, but neither of these two 
speech acts involves a linguistic material argument, as the direct objects have 
no propositional content. She then notes that three classes of verbs are based on 
the ‘say’ schema, namely, ‘say by means’ (mutter, shriek, mumble), ‘say with at- 
titude’ (bitch, gripe) and ‘discourse role’ verbs (ask, announce, comment, remark, 
tell). She points out that the three classes do not overlap, in that there are no 
monomorphemic verbs that mean ‘announce by shrieking’, ‘ask in a bitching 
way’ or ‘mutter by bitching’, etc. She proposes that all three sets of verbs modify 
the same single mutable feature of the ‘say’ schema to create other verbs. The 
‘say’ schema itself is just a template of argument places waiting for suitable mor- 
phology. Our suggestion is that in languages like Ikalanga, the predicate is filled 
by a complementizer from the argument that distinguishes the ‘say’ schema by 
virtue of the presence of a linguistic material argument." 

If this proposal is on the right track, then the suggestion by Diercks et al. that 
C raises to become main verb ‘say’ is essentially correct, but it does not support 
their proposal that movement of C to matrix vP or VoiceP is in any way general. 


3.5 Locality between matrix T and AGR-ti 


The more challenging locality relation for phase theory is the relationship be- 
tween T and the complement clause C, insofar as they are separated by a phase 
boundary. At least two different approaches come to mind that might address 
this puzzle. 

One way is to motivate (covert) movement of AGR-C into the periphery of the 
VoiceP phase in order to be high enough to agree with T. In the past, anaphors 
that can only agree with subjects and never with objects have been posited to 
move to the matrix T where only the Spec-TP locally c-commands them (on 
“hoisting” analyses, see Safir 2013). The motivation for moving nominal anaphors 


This proposal grew out of a discussion with Jane Grimshaw, personal communication. 
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in this way was always thin, driven by the needs of the locality of binding, rather 
than any morphological property of tense apparent on the element that moves.l 

This proposal, essentially that of Diercks et al. (2017), is anaphor movement, 
except what moves has no nominal character. The vestige of T-agreement on C 
is a posited residue of covert head-movement of AGR-ti to the edge of VoiceP, as 
in (32) (bolding=copies). 


(32) [rp T [voice AGR-ti [voicep EA [voice ---[cp AGR-ti [rp ---]] ]] 1] 


The AGR-ti adjoined to VoiceP is an unpronounced higher copy resulting from 
movement. Adjoined to the VoiceP, abstract AGR-ti is then in the VoiceP phase 
edge and local to T. A serious problem for such an account is that Spell-out should 
apply to the lower copy of AGR-ti (the one that is overt) as soon as the VoiceP 
phase is complete. T is outside the VoiceP phase, however, so the lowest C copy 
of the complement clause could only get its tense agreement by a counter-cyclic 
operation of inheritance down into the closed VoiceP phase. This counter-cyclic 
Spell-out breaches the Phase Impenetrability Condition (Chomsky 2001). 

An alternative account would be to assume that the complement clause can 
extrapose to a position at the edge of VoiceP where the edge of the complement 
clause CP phase would be visible to T in the matrix clause CP phase. Notice that 
it is possible for a complement clause to be separated from adjacency to the verb 
by an adverb. 


(33) Néó w-ákà-léb-á chôsèlélè (a)-ka-ti Nchidzi 
Neo sM.cl-PsT2-say-fv indeed  sM.c1-PST2-that Nchidzi 
w-aka-téng-a lori 
SM.c1l-PST2-buy-fv car 
‘Neo said it definitely that Nchidzi had bought a new car: 


If the complement clause can extrapose by adjunction to VoiceP, where C is 
on the edge of CP, and CP is on the edge of VoiceP, then C is in the same phase 
as T. 


(34) [T [VoiceP [VoiceP ..(EA) [Voice M [vp ICE AGR-C [TP]] Jj] [cp AGR-C 
[TP]]] 


PDiercks et al. attempt to motivate movement of anaphoric features to adjoined vP (or VoiceP) 
position on the basis of creating “referential” vPs, which, to avoid crashing, must have no 
meaningful, but unvalued features in them. It is unclear how they can account for cases like 
The men considered themselves to have praised themselves, where the EA of praise has no valued 
features to contribute, so movement of anaphoric features of the direct object to vP is not 
motivated or helpful. Such derivations should crash in their account. 
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The (underlined) overt copy of AGR-C in the extraposed CP at the edge of the 
complement clause domain is no longer separated from T by a phase boundary 
in (34). The underlined AGR-C can thus agree with Voice in the higher phase if 
Voice (and v) is incorporated into T, as is usually assumed in Bantu (but clausal 
subjects would still be outside the c-command domain of T, so would not agree 
in voice, as shown in (23)). Thus T-agreement (from anaphoric tense features 
parasitic on agreeing C) would be direct, not indirect via copy inheritance down, 
as in (32). 

Rackowski & Richards (2005) have proposed extraposition of exactly the sort 
we propose to make the CP phase edge susceptible to wh-extraction, and in par- 
ticular, with accompanying agreement relations established with the extraposed 
clause. We set aside investigation of this parallel (pointed out to us by Michael 
Diercks, personal communication) for future investigation. 


4 Conclusion 


Although our approach holds out the promise of applying more generally to other 
C-agreement systems, empirical studies of matrix C-agreement are still sparse 
and our proposals will have to be tested against the additional patterns that may 
be discovered (including Diercks & Rao 2019, not addressed here). Nonetheless, 
the relations between voice and tense and subordinate C-agreement uncovered 
in Ikalanga will have to be accounted for in any future approach. 


Abbreviations 

& associative plural OM object marker 

AUX auxiliary PASS passive 

c noun class PRN pronoun 

CAUS causative PRS present 

fv final vowel PST1 recent past 

FUT future PST2 remote past 

INF infinitive RCM reciprocal verbal affix 
IV initial vowel RFM reflexive verbal affix 
LOC locative SBJV | subjunctive 

NEG negation SM subject marker 
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Optional past tense in Wolof 
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University of Florida 


In this paper, we discuss the interpretation of the past temporal marker oon in 
Wolof (Niger-Congo; Senegal), in light of recent claims in the literature regarding 
its status as a so-called “discontinuous past.” We show that the cessation inference 
associated with oon is a conversational implicature. Thus, oon can receive an anal- 
ysis as a plain semantic past tense. 


1 Introduction 


There has been some debate in the recent literature regarding the semantic na- 
ture of so-called “discontinuous past” markers. On the one hand, Plungian & van 
der Auwera (2006), to whom the term “discontinuous past” is due, characterize 
its meaning as “past and not present” or “past with no present relevance.” 

On the other hand, Cable (2017) argues that the apparently discontinuous se- 
mantics of the Tlingit (Na Dene, Alaska and British Columbia) decessive form 
(Leer 1991) is actually a defeasible implicature, i.e., not part of the conventional 
semantics of the tense form. Cable assigns a plain past tense semantics, and the 
implicature of “not present” or “no present relevance" is derived via competition 
with temporally unmarked clauses, which can receive either a past or present 
interpretation. Cable further observes that discontinuous pasts are found exclu- 
sively in languages where overt past marking is optional! and thus calls into 


'By “optional” past, we are referring to the fact that past temporal reference can also grammat- 
ically be achieved with temporally unmarked clauses. Speakers may nevertheless make use of 
optional past markers for specific rhetorical purposes, as highlighted by Plungian & van der 
Auwera (2006). 
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question whether the category of discontinuous past exists at all in natural lan- 
guage. 

In this paper, we contribute to this discussion by examining the past tempo- 
ral marker oon? in Wolof (Niger-Congo; Senegal; see Church 1981; Robert 1991). 
This tense marker was identified by Plungian & van der Auwera (2006) as a dis- 
continuous past, and the Wolof data formed an important part of their argument 
for the existence of discontinuous pasts in the world's languages. The main ev- 
idence for this claim comes from a cessation inference associated with the use 
of oon in contrast with temporally unmarked past-referring sentences.? For in- 
stance, comparing (1) and (2), the addition of oon in (2) gives rise to a cessation 
inference that the result state of the event (here, the subject being gone) no longer 
holds at present? 


(1) Dem-na-@  Ndar. 
g0-C-SCL.3SG Saint-Louis 
‘He left for Saint-Louis (and is still there), 
‘Il est parti à Saint-Louis (c'est toujours vrai, il n'est pas là): 
(Robert 1991: 279) 


(2) Dem-oon-na-2 Ndar. 
go-PST-C-SCL.3sG Saint-Louis 
‘He had left for Saint-Louis (and has since come back)’ 
‘Tl était parti à Saint-Louis (et en To, il est revenu): (Robert 1991: 279) 


The use of oon with stative predicates gives rise to the inference that the state 
no longer holds in the present, as illustrated by the translations in (3-4). 


(3) Tiit-na-a. 
afraid-c-scr.1sc 
‘Tam afraid: (Torrence 2012: 25) 


?The past marker surfaces as oon if the preceding element ends in a consonant, and as woon if 
it ends in a vowel, as a result of a phonological hiatus repair. 

?Like Cable, we follow Altshuler & Schwarzschild (2012) in using the terminology of "cessation" 
to describe this inference. 

^In examples taken from other sources, we modify the morpheme glosses according to the analy- 
sis of Martinovié (2015). For examples from Robert 1991, we keep the original French translation 
and add our own colloquial English translation. The translations do not represent our analysis 
of the Wolof forms. 
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(4) Tüt-6on-na-a. 
afraid-PST-C-SCL.1SG 


‘I was afraid (but I am not now): (Torrence 2012: 26) 


The analysis of oon as a discontinuous past contrasts with that of the past 
tense in languages like English, which makes no claim about the state of affairs 
at present. For instance, in (5), we have a discourse about a past time, and the past 
tense is used in each clause. These uses of the past tense in English simply refer to 
the topical past time, and make no claims about the state of affairs at the speech 
time. For instance, the sentence in (5c) only makes a claim about the past topic 
time (the time of looking in the room), and not about the present; intuitively, the 
book is still in Russian at the speech time (if it still exists). 


(5) Context: A judge poses question (a) to a witness, who replies with (b-c): 
a. What did you notice when you looked in the room? 
b. The light was on. There was a book on the table. 


c. It was in Russian. (Klein 1994) 


In this paper, we argue that oon is in fact not a discontinuous past, but rather 
a past marker with a conventional meaning parallel to the past tense in English. 
Using diagnostics similar to those that Cable (2017) used in his study of Tlingit, 
we show that the cessation inference of oon is not part of its conventional mean- 
ing, but rather is a conversational implicature, arising due to competition with 
temporally unmarked clauses (see also Bochnak 2016 on optional past in Washo 
(Hokan/isolate, California and Nevada)). In this respect, Wolof oon is similar to 
other optional past markers in other languages, as has been argued in the recent 
literature. In this respect, we concur with Church (1981), who also showed that 
oon does not always have a “discontinuous” interpretation (though with different 
terminology and analytical tools). 

The paper proceeds as follows. In 82, we discuss the temporal interpretation 
of tenseless clauses in Wolof, while in 83 we turn to the interpretation of oon and 
show that it behaves like an ordinary past tense marker. $4 contains our analysis, 
including a proposal for deriving the cessation implicature associated with oon. 
In 85 we survey some syntactic evidence that suggests that oon does not behave 
syntactically like a tense head. $6 concludes. 

Unless noted otherwise, all the data in the paper were obtained by the sec- 
ond author in Saint-Louis, Senegal, during March 2016 and April-May 2017. All 
speakers were native speakers of Wolof, and Wolof was their first language. The 
data represent judgments of nine speakers, age 30 to 68. We use direct elicitation 
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in order to replicate as closely as possible the data used in previous works on 
so-called discontinuous pasts in other languages (Bochnak 2016; Cable 2017). 


2 Temporal interpretation of tenseless clauses 


Wolof finite indicative clauses have an obligatory complementizer layer (Marti- 
novié 2015). There are several types of complementizers with different syntactic 
and information-structural properties; these differences do not concern us here 
as they do not affect the temporal interpretation. We therefore gloss all comple- 
mentizers as C. 

Tense marking and negation in Wolof are only possible in the presence of 
a complementizer (Njie 1982). Wolof also has minimal clauses (Sauvageot 1965; 
Church 1981; Dialo 1981; Robert 1991; Zribi-Hertz & Diagne 2003), which can be 
used in a narrative context and appear to be smaller than TPs.? The temporal in- 
terpretation of such clauses is determined with respect to a previously introduced 
temporal anchor. In this paper we are therefore only concerned with clauses that 
contain the CP and TP layers. 

In clauses with no overt tense/aspect marking, stative predicates receive a 
present interpretation by default, as in (6-7). 


(6) Baax-na-2. 
good-c-scr.3sG 


‘It is good// *'It was good’ 


(7) Da-ma mer. 
do.c-SCL.1sG angry 


‘Tam angry. / *‘I was angry: 
Meanwhile, eventive predicates receive a default past interpretation, as in (8- 


9). As shown in (10), activities pattern with other eventive predicates, rather than 
states. 


(8) Xale yi lekk-na-ñu  ceeb. 
child the.PL eat-c-sCL.3PL rice 


‘The children ate rice?/ *“The children are eating rice’ 


>Zribi-Hertz & Diagne (2003) consider them to be vPs, but they can contain imperfective aspect, 
which suggests they are at least as big as an AspP. 
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(9) Musaa dem-na-2. 
Moussa leave-c-scr.3sG 


‘Moussa left?/ **Moussa is leaving: 


(10) Musaa fécc-na-2. 
Moussa dance-c-scr.3sG 


‘Moussa danced./ *‘Moussa is dancing: 


However, these defaults are not tied to the aspectual class of the predicate per 
se. Derived statives (e.g., eventive predicates co-occurring with ‘imperfective’ di) 
can also have present temporal reference, as in (11-12). 


(11) Usmaan-a di (»Usmaanay) gis Musaa. 
Oussman-c IMPF see Moussa 


‘It’s Oussman who sees Moussa. 


(12) Daf-a-@ di (>dafay) añ, mén-ul fiéw. 
do-C-SCL.3SG IMPF eat.lunch, can-NEG come 
‘He is eating lunch, he cannot come: 
‘Il est en train de manger, il ne peut pas venir: (Robert 1991: 263) 


To account for these facts, we follow the principles of Smith & Erbaugh (2005); 
Smith et al. (2007) for default temporal interpretation of tenseless clauses. These 
principles were developed to account for temporal interpretation of the tense- 
less language Mandarin (Smith & Erbaugh 2005), and have been applied to other 
tenseless languages, such as Navajo (Smith et al. 2007) and Hausa (Mucha 2013). 
The three main principles - the Deictic Principle, the Simplicity Principle of In- 
terpretation, and the Bounded Event Constraint - are given in (13-15): 


(13) Deictic Principle 
Situations (events) are located with respect to UT 
De, utterance time is the default reference point) 


(14) Simplicity Principle of Interpretation 
Choose the interpretation that requires the least information added or in- 
ferred. 


ĉIn the examples, -y is an allophonic realization of di; see Dunigan 1994, Torrence (2005; 2012), 
Martinovié (2015). 
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Hierarchy of Simplicity: 

a. RT = UT: Present time reference is the simplest kind of temporal ref- 
erence since (i) an utterance event always provides a time interval to 
which an RT variable can be anchored, namely UT; (ii) present inter- 
pretation requires no displacement of either the time or world of eval- 
uation 

b. RT « UT: Past time reference is more complex since it requires the 
displacement of RT from the concrete utterance event 


c. RT » UT: Future time reference involves both RT shifting but also 
modal displacement, and thus increases the level of abstraction 


(ensures that present is preferred over past, which is in turn preferred over 
future) 


(15) Bounded Event Constraint 
Bounded events are not located in the present. Speakers follow a tacit 
convention that communication is instantaneous. The present 
perspective is incompatible with the report of a bounded event, because 
the bounds would go beyond that moment. 
(bounded events cannot be located in the present) 


The Deictic Principle states that the utterance time is the default reference 
point for temporal interpretation. Together with (14a), this predicts a present 
interpretation as a default. However, by (15), bounded events - which cover (per- 
fective) eventive predicates — cannot be located in the present. These are then 
shifted to a past interpretation, given (14b). This setup also predicts that future 
reference with tenseless clauses is dispreferred. In many tenseless languages, ad- 
ditional morphology must be used to achieve future time reference (Matthew- 
son 2006; Tonhauser 2011; Bochnak 2016). This is indeed also the case for Wolof, 
where the imperfective marker di is used for future reference, as in (16). 


(16) Di-na-a toog ceeb-u-jén. 
IMPF-C-SCL.1SG cook rice-GEN-fish 


‘T will cook ceebujén: 


"See Bochnak & Martinović 2018 for discussion and an analysis of imperfective di and its future 


uses. 
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3 The interpretation of oon 


Turning to the semantics of oon, we argue that it is a regular past tense marker. 
The main pieces of evidence for this claim are that clauses with oon are obliga- 
torily interpreted as past, and the cessation inference (i.e., “discontinuous” in- 
terpretation) does not always occur with oon. We also show that oon is not an 
English-style perfect, and that oon is found in counterfactual conditionals. 

First, we find that oon is only compatible with past time reference. In addition 
to the examples we have already seen, we add (17-18) below. 


(17) Baax-oon-na-2. 
good-PST-C-SCL.3SG 


‘It was good:/ *‘It’s good: 


(18) Xale yi lekk-oon-na-fiu ceeb. 
child the.PL eat-PST-C-SCL.3PL rice 


‘The children ate rice./ *“The children are eating rice. 


Second, the cessation inference associated with oon is not always present for 
all speakers.® Recall the data in (1-2), repeated here, which show that the use of 
oon can trigger a cessation inference. 


(19) Dem-na-2  Ndar. 
g0-C-SCL.3SG Saint-Louis 
‘He left for Saint-Louis (and is still there): 
‘Tl est parti à Saint-Louis (c'est toujours vrai, il n'est pas là): 
(Robert 1991: 279) 


(20) Dem-oon-na-2 Ndar. 
go-PST-C-SCL.38sG Saint-Louis 
‘He had left for Saint-Louis (and has since come back)? 
‘Tl était parti à Saint-Louis (et en To, il est revenu): (Robert 1991: 279) 


"There is both interspeaker and intraspeaker variation in this. Some speakers insist on the 
cessation inference in some contexts but not in others, and for some speakers it is never present. 
We have not found any speakers for whom the cessation inference is obligatory in all contexts 
that we tested. 
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However, when a past topic time is overtly specified, e.g., by a time adverbial 
as in (21), there is no cessation implicature.? 


(21 Musaa jénd(-oon)-na-@ oto bu beesat  bijáll, waye 
Moussa buy(-PsT)-C-SCL.38G car C new year c past but 
mu-angi (> mungi) ko di(» koy) dawalba  léegi. 
SCL.3SG-C OCL.3SG IMPF drive until now 


‘Moussa bought a new car last year, but he is still driving it’ 


With predicates such as xaru 'kill oneself', many speakers report that the use 
of oon implies that Moussa is now alive again, or that the suicide attempt was un- 
successful (i.e. a cessation inference is detected). However, this effect is reported 
to go away for some speakers in particular contexts; e.g. if (22) is said as part of 
the story of Moussa's life, or if we are retelling the events of, for example, last 
week. 


(22) Musaa xaru(-woon)-na-S ayubés bi weesu. 
Moussa kill oneself (-pst)-c-scL.3sG week c past 


‘Moussa killed himself last week? 


The example (5) from Klein (1994) is also felicitous in Wolof for most of our 
speakers, as shown in (23). Even though the book presumably still exists and is 
still in Wolof, oon can be used in the answer in (23c). 


(23) Context: A judge poses question (a) to a witness, who replies with (b-c): 

a. Lan nga gis bi nga  xoole neeg bi? 
what c.scr.2sc see when scr.2sc look.at-ANT room the.sc 
"What did you see when you looked at the room?' 

b. Làmp bi tàkk-oon-na-2. Am-oon-na-@ benn téeré 
lamp the.sc be.alight-PsT-C-scr.3sc. have-PST-C-SCL.3sG one book 
bu ubbeeku si kaw taabal bi. 

c be.open ontop table the.sc 
"The light was on. There was an open book on the table: 

c. Téeré wolof la-@ (woon). 

book Wolof C-scr.3sc (PST). 


‘It was/is in Wolof? 


?We place oon in brackets to indicate that the sentences with and without oon are accepted by 
speakers in the context provided. 
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Since the cessation inference is not always present, we conclude that it is not 
part of the lexical semantics of oon. Therefore, we do not consider it a “discon- 
tinuous past” in the sense of Plungian & van der Auwera (2006), since is not part 
of its conventional meaning that a state of affairs fails to hold at speech time. 

Third, we observe that oon does not behave like an English-style perfect.!? 
The English perfect does not co-occur with temporal frame adverbials (Klein 
1992), see (24). However, we have already seen in (21) that oon can co-occur with 
temporal adverbials. 


(24) *Ihave bought a car yesterday/last year/on December 1, 2010. 


The English perfect also displays so-called lifetime effects (McCawley 1971). 
The sentence in (25) is apparently infelicitous because Christopher Columbus 
is no longer living. However, as shown in (26), Wolof oon does not have this 


property. 


(25) *Christopher Columbus has discovered America. 


(26) Colombo féefial(-oon)-na-@ Amerik. 
Columbus find-(PsT)-C-scr.3sc America 


*Columbus found America: 


Another property of the English perfect (and of so-called terminative aspects 
more generally, see Bohnemeyer 2002), is that they assert that the result state 
of the perfect-marked event still holds. This means continuations like in (27) are 
infelicitous. These types of examples are nevertheless felicitous in Wolof with 
oon, as shown in (28). 


(27) Ihave lost my glasses, *and now I (have) found them. 


(28) Sama lunettes réer(-oon)-na-@-ma, waye 
poss.1sG glasses lose(-PsT)-C-sCL.3sG-OCL.1sG but 
gis(-oon)-na-a-ko. 
see(-PST)-C-SCL.1SG-OCL.3SG 


‘I lost my glasses, but I found them: 


We acknowledge that perfects in many languages do not have these properties. 
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Fourth, we find oon in counterfactual conditionals. Past tense marking is com- 
mon cross-linguistically in counterfactual conditionals (Iatridou 2000; Halpert & 
Karawani 2012), including in English. The sentence in (29) has a present topic 
time (by the presence of right now), but has past morphology in the antecedent. 
We also see this in Wolof, where oon appears in counterfactual conditionals, as 
in Gol 


(29) IfIwas in Paris right now, I would be eating a croissant. 


(30) Su-ma  ragal-oon rabi, di-na-a tiit léegi. 
if-scL.1sG be.afraid.of-PsT spirit, IMPF-C-SCL.1SG be.afraid now 


‘If I was afraid of spirits, I would be afraid now: 


Although the role of the past tense in counterfactuals is a matter still very 
much under debate, it is certainly striking that Wolof uses this marker in coun- 
terfactuals, just like in many other typologically unrelated languages. 

In sum, apart from its optionality, oon behaves like a regular past tense, where 
the apparent discontinuous semantics are not part of its conventional meaning. 
We therefore propose to analyze it semantically as a regular past tense, just like 
other optional pasts in Washo (Bochnak 2016) and Tlingit (Cable 2017). 


4 Analysis 


We define tense as a morpheme whose conventional meaning relates a reference 
or topic time with the speech time, or possibly another evaluation time (Reichen- 
bach 1947; Klein 1994). We assume a pronominal or referential theory of tense, 
whereby the reference time of a clause is represented as a temporal pronoun 
located in the T head (e.g., Abusch 1997; Heim 1994; Partee 1973, among many 
others). Like other pronouns, it bears an index, and receives its value from an 
assignment function g. Every finite clause contains a reference time pronoun, 
regardless of whether there is an overt tense morpheme or not. We treat oon as 
a tense feature which modifies the temporal pronoun, placing a presupposition 
on its possible values (i.e., restricting it to times in the past of the speech time). 


"We do not intend this point as an argument against a discontinuous past analysis, but rather 
as evidence that oon behaves quite similarly to non-discontinuous pasts in more familiar 
languages. 
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We propose that a sentence such as (31) has the clause structure given in Fig- 
ure 1? Semantically, AspP denotes a predicate of times as in (32a), where we 
assume arguments of the verb are interpreted in their base position. The tempo- 
ral argument slot is filled in by the reference time pronoun. When oon, defined 
in (32c), surfaces, it adds the presupposition that the reference time is located in 
the past of the speech time tọ. (In the absence of a morphological tense, we as- 
sume the value of the reference time pronoun in T is restricted by the principles 
outlined in §2.) The result is a proposition meaning, given in (32d). 


(31) Colombo daf-a-2 féñaal-oon Amerik. 
Columbus do-c-scr.3sc discover-PsT America 


“Columbus DISCOVERED America’ 


(32) a. [AspP ]&* = AtAw.discover(a)(c) at t in w 


b. [T1 ]8* = g0) 
c. | oon ]8** = At.t ; defined only if t < t, 
d. | TP ]&*^ = Aw.discover(a)(c) at g(1) in w ; defined only if g(1) < te 


Under this analysis, cessation is not part of the lexical semantics of oon, contra 
Plungian & van der Auwera (2006). Instead, oon only adds a plain past presuppo- 
sition, just like the past tense in English. The question, then, is how to account 
for the robust intuition, both by native speakers and previous authors, that the 
use of oon in many contexts generates a cessation inference. 

We suggest that the cessation inference is a conversational implicature derived 
by the Gricean Maxim of Manner (cf. Altshuler & Schwarzschild 2012; Cable 2017, 
for whom cessation inferences are analyzed as scalar implicatures). Following 
Levinson (2000), a marked message indicates a marked situation. We assume 


The example (31) is of a predicate focus sentence with do-support in C. We choose this clause 
type for illustration as the verb here stays low, unlike in some other cases when it raises to C 
taking oon with it. The clause structure in Figure 1 is somewhat simplified from what Marti- 
novié (2015) assumes; any differences are not relevant for our analysis here. For example, the 
non-pronominal subject in these clauses is in the left periphery (Spec,CP), and it is doubled 
by a subject clitic which is here represented in Spec;TP (the clitics all move to adjoin above 
TP at a late stage in the syntax). The details of the doubling are not relevant here; we assume 
that the non-pronominal argument is generated in the subject position in Spec,vP (omitted for 
simplicity). Additionally, the verb also raises through the Asp head and carries it on to T, but 
we also omit this here. 

PRobert (1991) analyzes oon as a relative past, in which case the reference time can be related to 
an evaluation time other than the speech time, i.e., t, in (32c) can be distinct from the speech 
time. 
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CP 
DP C 
Colombo ds s 
C TP 
V C SCL T 
daf a D EE 
T AspP 
V T Asp VP 


oon Tj ty DP 
Amerik 


Figure 1: Wolof clause structure 


that a past-referring clause containing oon is *marked" in comparison to a past- 
referring tenseless clause. A Gricean chain of reasoning proceeds as follows. A 
sentence with oon is morphologically more marked than a sentence without oon. 
Given that past marking is not required by the grammar of Wolof, the speaker 
(in most situations) could have used an unmarked form for past temporal refer- 
ence. Since the speaker used a marked form, the hearer infers that the speaker 
must believe that the situation is marked. That is, more than just a plain past 
meaning is intended by the speaker. The hearer infers that the state of affairs 
described does not hold at present, otherwise the simpler form could have been 
used. Therefore, cessation is an inference derived from the fact that oon is op- 
tional, and temporally unmarked clauses can also have past interpretations. 

If the implicature calculation is based on Manner, whereby a marked message 
leads to pragmatic enrichment, the question arises as to why a cessation impli- 
cature in particular is calculated. Why could some other inference not be calcu- 
lated? Apparently, other inferences are in fact possible and attested. For instance, 


HA reviewer points out the possibility that Gricean conversational maxims may not be followed 
in all cultures. At this point, we have no reason to believe they do not apply in Wolof, but leave 
further investigation into this question for later work. 


198 


10 Optional past tense in Wolof 


Church (1981) claims that the use of oon often gives rise to a remoteness inference 
as well. Our speakers also seem to prefer oon when talking about a time in a 
more distant past. Some speakers even report they must use oon in those cases 
(e.g. when speaking about an event that occurred last year). This inference would 
also be a conversational implicature, and not part of the lexical semantics of oon, 
given examples like (33), where oon is possible with demb ‘yesterday’ (assuming 
one day ago does not count as distant"). 


(33) Musaa jénd(-oon)-na-@ oto bu bees démb 
Moussa buy-PST-C-SCL.38G car C new yesterday. 


‘Moussa bought a new car yesterday: 


5 Is oon a tense head? 


There is some indication that oon is syntactically not a head. First, in clauses in 
which the verb raises to C, oon is affixed onto it in affirmative cases, as in (34) but 
skipped over in the presence of negation, shown in (35). Martinovié (2015; 2016) 
proposes an analysis of the affixation of oon in which she argues that oon affixes 
onto the verb postsyntactically (at PF) in a certain syntactic configuration, but 
not in others. Crucially, for her analysis to go through, oon cannot be a head, as 
it would then always be picked up by head movement. She therefore suggests 
that oon is a phrasal morpheme in Spec,TP. 


(34) Xale yi lekk-oon-na-ñu jén. 
child the.Pr eat-PsT-C-sCL.3Pr fish 
"The children ate fish? 


(35) Xale yi lekk-u(D-Z-üu woon jën. 
child the.PL eat-NEG-C-SCL.3PL PST fish 
‘The children didn’t eat fish’ 


The second piece of evidence that casts doubt on the treatment of oon as a T 
head is that it can occur apparently affixed onto non-verbal elements, as reported 
by Torrence (2012). 


(36) Kan-ati-woon la-ñu dóor? 
who-again-PST C-SCL.3PL hit 
"Who did they hit again?' (Torrence 2012: 24) 
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Given the examples above, it is possible that oon is not a tense morpheme, 
but a particular type of a temporal adverbial (cf. Tonhauser 2006 on an optional 
past adverbial in Paraguayan Guaraní). We leave this only as a speculation at 
this point, as the matter requires further research. Given that our definition of 
tense in $4 only makes reference to semantic notions, the question of whether 
oon behaves syntactically like a T head is orthogonal to the core aspects of our 
semantic analysis, where oon is treated as a tense (i.e., relating a reference time 
to speech time). 


6 Conclusion 


We have argued that oon in Wolof marks past temporal reference, and can be 
given an analysis of a semantic past tense. The cessation inference detected by 
Church (1981) and Robert (1991), and analyzed as discontinuous past by Plungian 
& van der Auwera (2006), is not present in all uses of oon, and we argue this is 
a conversational implicature. Given that the same conclusion was reached for 
the optional tense languages Washo (Bochnak 2016) and Tlingit (Cable 2017), the 
status of discontinuous past as a grammatical category is therefore called into 
question. 
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ANT anterior PL plural 

C complementizer POSS  possessive 
GEN genitive REL relative 

IMPF imperfective SCL subject clitic 
NEG negation SG singular 


oc. object clitic 
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Chapter 11 


The Syntax of experiencers in 
Sereer-Siin 


Khady Tamba 
Université Cheikh Anta Diop 


The special grammatical status of experiencers has been at the center of various 
studies within linguistics for several decades (Belletti & Rizzi 1988; Ameka 1990; 
Pesetsky 1995). This attention is due to the fact that there are specific syntactic fea- 
tures that are only associated with experiencers (Landau 2010). This study investi- 
gates object experiencers in Sereer-Siin - a dialect of Sereer, a West Atlantic lan- 
guage of the Niger Congo family spoken in Senegal. I show that object experiencers 
in Sereer can be classified in two groups according to their syntactic behavior. In 
this study I provide evidence that Sereer experiencers can be used to extend Belletti 
and Rizzi's traditional classification of experiencers. Using data from Italian, Bel- 
letti and Rizzi have classified object experiencers in two groups. The first one they 
refer to as class II, has a nominative theme and an accusative experiencer whereas 
the second one, class III, has a nominative theme and a dative experiencer. In Sereer, 
like in Italian, different properties of object experiencers account for asymmetries 
noted with respect to constructions like passive, antipassive and nominalization. 


1 Introduction 


11 Language Background 


Sereer-Siin (Sereer) is a West Atlantic language of the Niger Congo family spo- 
ken in Senegal. It is genetically related to Wolof and Pulaar (Simons & Fennig 
2017). There are various dialects of Sereer. This variety, considered the standard, 
is spoken in the areas traditionally known as Siin and Saalum. The basic word 
order of Sereer is SVO. Sereer is an agglutinative language; this is exemplified in 


(1). 
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(1) a o njaaconge bog -u -ir -k  -at-ir -an 
CL child CLpgp wash -REF -INST -IPFV -IT -NEG -30BJ 
"The child will not wash him/herself with this again' 
b. xa caac axe bog -u -ir -k  -at-ir -an 
CLp, child C154, ,,, wash -REF -INST -IPFV -IT -NEG -OBJ 


"The children will not wash him/herself with this again' 


There are various studies dealing with the noun class system in Sereer (Fal 
1980; Faye 1979; Faye 2013; Renaudier 2012); however there is no agreement about 
the number of noun classes found in Sereer. For instance, Faye (2013) argues 
that Sereer has nine noun classes, whereas for Fal (1980) and McLaughlin (1992), 
Sereer has thirteen noun classes. Table 1 from McLaughlin (1992: 284) gives a list 
of the different noun classes found in Sereer. 


Table 1: Sereer noun classes 


Class Prefix Clitic determiner Class content 


1 o- oxe human singular 
ø- we human plural 

3a a- ale singular 

3b a- ale augmentative singular 

4 a- ake plural 

5 Ø - le singular 

6 ø- ne singular 

7 Ø - fee singular 

8 fo- ole plural 

9 ø- ke plural 

10 o- ole singular 

11 xa- axe plural 


Indefiniteness is shown with the presence of a noun class marker (prefix in 
Table 1) before the noun whereas definiteness is shown through the presence of 
a prenominal and a postnominal class marker (clitic determiner in Table 1). 

Consonant mutation is another important characteristic of Sereer. It is used 
to show subject-verb agreement with regard to number; it is also used for in- 
flectional and derivational processes (see McLaughlin for a detailed analysis on 
consonant mutation in Sereer). Finally, in Sereer, object pronouns are suffixes 
incorporated to the noun (Baier 2018). 
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1.2 Experiencers in Sereer 


There are different types of experiencers. An example of subject experiencer is 
shown in (2a) with the verb bug ‘love’. Object experiencers are shown in (2b-d) 
with the verbs diidlat ‘be scared’, bet ‘surprise’ and soob ‘miss’ respectively. 


(2) a. 


Subject Experiencer 

Faatua bug a  Maamkoor 

Faatu 3sc love -PERF Maamkoor 

‘Faatu loves Mamkoor’ 

Object Experiencer 

Awaa  diid -lat -a o fes ole 
Awa 35G be.scared -CAUS PERF CL young.man CLpgr 
‘Awa frightened the young man’ 

Object Experiencer 

Faatua bet -a 0 njaac onge 

Faatu 3SG surprise -PERF CL child Core 

‘Faatu surprised the child’ 

Object Experiencer 

o fes ole a sooba o tew oxe 
CL young.man CLyrr 3SG miss PERF CL Woman CLypp 


"The woman misses the young man’ 


In (2b) the psych verb appears with a causative suffix whereas in (2c) and (2d) 
the verb does not bear any extra morphology. The rest of this paper is on object 
experiencers (2b-d). 


2 Previous Studies 


In their seminal work on experiencers, Belletti & Rizzi (1988) use Italian to posit 
three classes of experiencers. They are illustrated in (3) from Belletti & Rizzi (1988: 


291-292). 


(3) Italian (Indo-European; Belletti & Rizzi 1988) 


a. 


Gianni teme questo 
‘Gianni fears this’ 
Questo preoccupa Gianni 


"This worries Gianni’ 


205 


Khady Tamba 


c. A Gianni piace questo 
‘Gianni pleases this’ 
d. Questo piace a Gianni 


‘This pleases to Gianni’ 


Belletti & Rizzi (B & R) make the claim that (3a) and (3b) are syntactically sim- 
ilar as they can both be related to transitive structures. Thus verbs like temere in 
(3a) belong to a class of experiencer verbs which have an experiencer as a subject 
and a theme as an object. In contrast, in (3b) with preoccupare type verbs, the ex- 
periencer is in object position whereas the theme is in subject position. Finally, 
in (3c) with experiencer verbs of the piacere class, the verb appears with a dative 
experiencer as a subject and a nominative theme, however these arguments of 
the verb can appear in a different order as shown in (3d). 

In his study on the syntax of experiencers, Pesetsky (1995) puts aside the ex- 
pression "theme" in B & R and argues for the following: 


e Subject argument with object experiencers are always Causer. 


* Object experiencers always have one of these roles: Target of Emotion or 
Subject Matter of Emotion. 


Landau (2010: 3) builds on B & R and uses the three classes of experiencers to 
classify English experiencers. 


* Class I: Nominative experiencer, accusative theme. 
John loves Mary. 


e Class II: Nominative theme, accusative experiencer. 
The show amused Bill. 


e Class III: Nominative theme, dative experiencer. 
The idea appealed to Julie. 


Landau argues that experiencers are mental locations (locatives) and that uni- 
versally "object experiencers behave like oblique arguments, whether their gov- 
erning preposition is overt or not" Landau (2010: 127). He further makes the claim 
that in some languages object experiencers are overtly quirky and for that reason 
can occur in subject position. 

Throughout this paper I use B & R’s classification of Italian experiencers and 
adapt it to classify Sereer experiencers. 
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2.1 Classifying Sereer experiencers using B & R 


In this section, I show that Sereer experiencer verbs come in three classes (adap- 
ted from B & R). Verbs in the first class come with a nominative experiencer and 
an accusative theme whereas the verbs in the second class appear with a nomi- 
native causer, accusative experiencer. Finally, the last class of experiencer verbs 
have a nominative theme and a dative experiencer. These different classes of 
experiencers are illustrated next. 


2.1.1 Class I: Nominative experiencer, accusative theme 


In this class, verbs like bug ‘love’ and and ‘know’ are found, they have the struc- 
ture of a regular transitive verb. 


(4) a. Faatua bug -a  Maamkoor 
Faatu 3sc love -PERF Maamkoor 


*Faatu loves Mamkoor’ 
b. o tew oxe a and a Maamkoor 
CL lady CLygr 356 know PERF Maamkoor 


"The lady knows Mamkoor' 


In (4a) and (4b) the entity undergoing a psychological experience is in subject 
position. The next two classes describe object experiencers. 


2.1.2 Class II: Nominative causer, accusative experiencer 


These experiencer verbs come in two types. The first type must appear with a 
causative suffix whereas the second type does not occur with a causative suffix. 
Note that this is different from B & R's Class II. Even though they do not use 
"causer", my assumption is that a causer role is added to the subject of experiencer 
verbs of Class II. In (5a) and (5b) the verbs diidlat ‘frighten’ and jaaxdat ‘worry’ 
are used transitively and are morphological complex. 


(5) a. Awaa  diid -at -a o fes ole 
Awa 38G be.scared -CAUS PERF CL young.man CLypp 
‘Awa frightened the young man’ 


b. Faatua  jaax -dat -a o tew oxe 
Faatu 3sc be.worried -CAUS PERF CL lady CLpgr 


‘Faatu made the woman worried’ 
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In (5a) and (5b), the experiencers are in object position. The other type of object 
experiencer in this class is shown in (6). 


(6) a. Faatua bet a o njaac onge 
Faatu 3sG surprise PERF CL child Cry, 


‘Faatu suprised the child’ 


b. Faatua weg a o tew oxe 
Faatu 3sG be.unlucky PERF CL lady Cor 


‘Faatu brought bad luck to the woman’ 


In (6a) and (6b) there is no overt causative suffix on the verb. My assumption is 
that there is a silent causative suffix. As will be show in §3 these verbs behave the 
same in some syntactic environments. Object experiencers, according to Pesetsky 
(1995), add an additional causer argument. In addition, the argument related to 
the “cause” must be realized as the subject (Grimshaw 1990). 

Verbs of this class (Class II) behave like regular transitive verbs projecting a 
light verb headed by an overt or a silent causative. 


2.1.3 Class III: Nominative theme, dative experiencer 


Arguments of verbs of this class, shown in (7), appear in the same order as the 
ones of verbs of Class II described (6). 


(7) a. o fes ole a sooba o tew oxe 
CL young.man CLpg; 35G miss PERF CL woman CLopr 


'the woman misses the young man' 


b. Awaa fel a o njaac onge 
Awa 3sG appeal PERF CL boy ` Coen 


'Awa appeals to the boy' 


In (7a) and (7b) the verbs do not have a causative component associated with 
them. These verbs have been argued to be stative/unaccusative. My assumption 
is that the object experincers of these verbs are introduced by a silent preposition 
making them oblique. As will be shown, these objects behave like typical datives. 
The different classes of experiencers in Sereer are summarized in Table 2. 

The remainder of this paper focuses on Class II and Class III experiencers by 
discussing syntactic differences between the two types of object experiencers 
that account for the classification in Table 2. More specifically, I show that they 
behave differently with respect to passivization, nominalization and antipas- 
sivization. 
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Table 2: Sereer experiencers 


ClassI Nominative experiencer Accusative theme 
Class II Nominative causer Accusative experiencer 
Class II Nominative theme Dative experiencer 


3 Distinguishing between the two object experiencers in 
Sereer 


3.1 Passivization test 


In Sereer, passive is shown through the promoting of the verb internal argument 
to subject position whereas the external argument is demoted through suppres- 
sion. Passive construction is marked through the use of the suffix -el! on the 
infinitive verb (Faye 1979; Renaudier 2012; Faye 2013), however, this suffix has 
various allomorphs conditioned by aspect, tense and/or negation. 


(8 a. Awaa fiaama maalo fe 


Awa 3SG eat PERF rice CLpgr 


‘Awa ate the rice’ 
b. maalo fe a  fiaam-e? (*Awa) 
rice CLpgr 38G eat-PAss Awa 


‘the rice was eaten’ 


As seen in (8), with a regular transitive verb, the demoted external argument 
Awa cannot appear in passive constructions. Passivizing experiencer verbs yields 
different results according to the nature of the object. 

Sereer Class II object experiencers can successfully undergo passivization. 
This is illustrated in (9) and (10) with the verbs diidlat ‘frighten’ and bet 'sur- 
prise’. 

(9) a. Awaa diid -lat a o fes ole 
Awa 35G be.scared -CAUS PERF CL young.man CLpgr 


‘Awa frightened the young man’ 


b. o fes ole a  diid -lat  -e? 
CL young.man CLprr 3SG be.scared -CAUS -PASS 


"The young man was frightened’ 


'This suffix is referred to in Faye & Mous (2006) as anticausative. 
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In (9b) derived from (9a) the object experiencer is promoted to subject position 
and the verb, which is morphologically complex, must appear with a passive mor- 
pheme. A similar situation can be observed in (10a-b) with the verb bet ‘surprise’ 
which is morphological simple. Note however that I mentioned earlier that this 


verb is semantically similar to diidlat ‘frighten’ as they both have the “cause” 
component. 


(10 a. Faatua bet -a o njaac onge 
Faatu 3sG surprise PERF CL child CLpgr 
‘Faatu surprised the child’ 
b. o njaaconqea bet -e? 
CL child cL —3sG surprise -PASS 


‘the child was surprised’ 


In (10) the verb behaves as expected since it allows passivization. The object 


experiencer can move to subject position along with a demotion of the original 
subject through suppression. 


Next I show that Class III object experiencers cannot undergo passivization. 
This is illustrated in (11) with the verb soob ‘miss’. 


(11) a o fes ole a sooba o tew oxe 
CL young.man CLp,, 3SG miss PERF CL woman CLogp 
‘the woman misses the young man’ 
b. *o tew oxe a  soob-e? 
CL woman CLpgr 3SG miss -PASS 


"The woman was missed' (intended) 


(11b) shows that passive morphology is incompatible with verbs of this class, 
that is the object experiencers cannot be promoted to subject position. This is 
evidence that they are different from the ones in Class II. (12) follows a similar 
pattern with the verb fel ‘appeal to’. 


(12) a  Awaa fel -a o njaac onge 
Awa 35G appeal PERF CL child cry, 
“Awa appeals to the boy’ 
b. *o njaac onge -a fel -e? 


CL chile CL,gr 3sG appeal.to -PAss 
"The child was appealed to' (intended) 


After this passivization test, I use another test which consists of nominalizing 
the clause containing an object experiencer. 
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3.2 Nominalization test 


Grimshaw (1990) argues that nominalization and passivization are related in that 
in both cases the external argument is optional and as such, can be suppressed. 
Indeed in English, for instance, the external argument in such constructions is 
optional as (13) shows. 


(13) a. The door was opened (by John) 
b. The opening of the door (by John) 


These sentences show that in English the external argument can be suppressed 
in nominalization and passivization. In Sereer, a similar situation can be ob- 
served, however the suppression of the external argument in these constructions 
is mandatory as mentioned earlier with passsives. If these two constructions (i.e. 
passivization and nominalization) are related, one should expect to see results 
similar to the ones observed with the passivization test. 

In Sereer, Class II experiencer verbs can successfully undergo nominalization 
as (14) shows. 


(14) a. Awaa diid -lat -a o fes ole 
Awa 38G be.scared -CAUS PERF CL young.man CLpgr 


‘Awa frightened the young man’ 


b. Nominalization 
o diid -lat ole noo fes ole 
CL be.scared -CAUS CLpgp P CL young.man CLpgr 


‘The frightening of the young man’ 


In (14b) the nominal derived from (14a) appears with noun class markers. In 
addition, the internal argument of the verb, the object experiencer, is introduced 
by the preposition no. A similar pattern can be observed in (15) with the verb bet 
‘surprise’. 


(15) a. Faatua bet -a o njaac onge 
Faatu 3sG surprise PERF CL child Cry, 


‘Faatu surprised the child’ 


b. Nominalization 
o bet ole noo njaac onge 
CL surprise CLpg; P CL child cry, 


‘the surprising of the child’ 
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Just like in the previous example, in (15) the object experiencer appears in a 


prepositional phrase whereas the nominalized verb occur with nominalizers (i.e. 
noun class markers). 


Class III experiencer verbs fail to undergo nominalization. This is illustrated 
in (16). 


(169 a Awaa soob-a o tew oxe 
Awa 3sG miss PERF CL woman CLpg, 
‘Awa misses the woman’ 
b. "o soobole nono tew oxe 
CL miss CLpgp P CL woman CLorr 


"The woman being missed' (intended) 


In (16b), derived from (16a), nominalizing the verb results in ungrammatical- 
ity. This is expected since the verb does not assign accusative case to the object 
experiencer. A similar situation is can be noted in (17). 


(17 2. Awaa fel -a o tew oxe 
Awa 3SG appeal.to PERF CL woman CLpgr 
‘Awa appeals to the woman’ 
b. *o fel ole nonotew oxe 
CL appeal.to CLpgp P CL woman CLpgr 


"The woman being appealed to' (intended) 


In (17) the verb fel ‘appeal to’ cannot be nominalized as the ungrammaticality 
of (17b) shows. Surprisingly, if the passive morphology -el appears with a verb 
of this type (Class III) nominalization is possible as shown in (18). 


(18 a Awaa fel a o njaac onqe 
Awa 3SG appeal PERF CL boy CLpgp 
‘Awa appeals to the boy’ 
b o pel -el ole noo njaac onge 


CL appeal -PASS CLpgp P CL boy CLper 
"The appealing to the young man' (intended) 


c. *o pel ole noo njaac onge 
CL appeal CLpgp P CLboy CLpger 


‘The young man being appealed to’ 
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Faye (2013) argues that another use of the passive marker -el is to derive nom- 
inals from stative verbs. The behavior of Class III experiencer verbs with respect 
to nominalization shows that these verbs are different from the ones of Class II. 

These types of experiencer verbs are superficially transitive but underlyingly 
unaccusative (Belletti & Rizzi 1988; Pesetsky 1995; Landau 2010). 

In the next subsection, I use antipassivation to further distinguish between the 
two types of object experiencers. 


3.3 Antipassivization test 


The term “antipassive” is generally used to refer to a characteristic of voice in 
ergative languages (Crystal 2008). In antipassive constructions, the verb is se- 
mantically transitive but does not project a direct object (Polinsky 2017). 

Polinsky further argues that crosslinguistically, antipassive can be diagnosed 
through case marking, noun incorporation, agreement, word order, verbal affix- 
ation. She also provides evidence that antipassive can be found in accusative 
languages (see also Heaton 2017). 

In Sereer (Renaudier 2011), and related languages like Wolof (Creissels & Nou- 
guier-Voisin 2008) antipassive is marked through verbal suffixation. 


(19) a. o boxole a nat -a o njaac onge 
CL dog CLprr 3SG bite PERF CL boy CL pg 


"The dog bit the boy’ 

b. o PBoxole kaa pat -a -a  (‘o njaac onde) 
CL dog CLpgr IPFV.38G bite -ANTIP -PERF 
‘The dog bit’ 


In Sereer, the suffix -a is used on the verb to mark the antipassive construc- 
tion. The antipassive morpheme is very productive (not only related to verbs of 
transfer and ditransitives as argued in Renaudier). 

With respect to experiencer verbs, different results are noticed according to 
the type of object being dealt with. Verbs belonging to Class II can undergo an- 
tipassivization, that is they can appear with the passive marker along with a 
suppression of the object experiencer. This is shown in (20) and in (21). 


(20) a Awaa diid “lat a o fes ole 
Awa 35G be.scared -CAUS PERF CL young.man CLogr 


‘Awa frightened the young man’ 
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b. Awa kaa diid -lat -a -a 
Awa IPFV.3SG be scared -CAUS -ANTIP - PERF 


‘Awa frightened’ 


(21) a. Faatua bet -a o njaac onge 
Faatu 3sG surprise PERF CL child Cry, 


‘Faatu surprised the child’ 


b. Faatu kaa bet -a -a 
Faatu IPFV.3SG surprise -ANTIP PERF 


‘Faatu surprised’ 


These examples show that experiencer verbs of Class II behave like regular 
transitive verbs in that they can undergo antipassivization. In both (20b) and in 
(21b) the object experiencer argument is suppressed. 

Contrary to Class II verbs, verbs of Class III cannot occur with the antipassive 
marker -a. 


(22) a. Awaa fel -a o tew oxe 
Awa 3SG appeal PERF CL woman CL,gr 


‘Awa appeals to the woman’ 


b. * Awa kaa fel -a -a 
Awa IPFV.3SG appeal -ANTIP -PERF 


‘Awa appealed to’ (intended) 


In (22b) the verb fel ‘appeal to’ appear with the antipassive marker and this 
yields ungrammaticality. The same situation can be observed in (23b). 


(23) a. o njaac onge a sooba o tew oxe 
CL boy CLpgr 3SG miss PERF CL woman CLygr 


‘The woman misses the boy’ 


b. *onjaac onge kaa soob -a 
o njaac CLpg, IPFV.3SG miss -antip 


In this section I have used various tests (i.e. passivization, nominalization and 
antipassivization) to substantiate the claim that Sereer object experiencers come 
into two classes, Class II and Class III. 
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4 Conclusion 


The main aim of this study was to describe object experiencers in Sereer in light 
of Belletti & Rizzi (1988). I have shown that they come in two types, Class II 
and Class III. Contrary to Landau (2010), I have shown that Class II object ex- 
periencers are not oblique and behave like regular transitive verbs. In contrast, 
Class III object experiencers are oblique and as such do not display typical ob- 
ject properties. It is my assumption that these objects are introduced by a silent 
preposition. This is in line with B & R's analysis of experiencers of this type as 
being assigned an inherent dative case. This study is not only a contribution to 
the literature of experiencers but is also a contribution to the study of argument 
structure in Sereer. Table 3, repeated from above, summarizes the different prop- 
erties of the experiencer verbs found in Sereer. 


Table 3: Sereer experiencers 


ClassI | Nominative experiencer Accusative theme 


Class I Nominative causer Accusative experiencer 
Class II Nominative theme Dative experiencer 
Abbreviations 
ANTIP  antipassive NEG negation 
CAUS causative PASS passive marker 
CL noun class marker P preposition 
Clings definite PERF perfective 
FOC focus REF reflexive 
INST instrument 308J third person object pronoun 
IPFV imperfective 3SG third person singular 
IT iterative 
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In this paper, we discuss the realization of focus in Limbum (Grassfields Bantu, 
Cameroon), a language which shows a so-far unattested pattern of focus mark- 
ing, where two distinct focus constructions are realized by two different particles, 
a and bá, which express information focus on the one hand and contrastive fo- 
cus on the other. Strikingly, the former is realized by a structurally more complex 
construction (particle + fronting) - the inverse pattern of what is attested cross- 
linguistically (Fiedler et al. 2010; Skopeteas & Fanselow 2009). A biclausal cleft 
structure underlying the á strategy can be argued to be implausible. Instead, we 
adopt a Q/F particle analysis (Cable 2010) which proposes the existence of a par- 
ticle independent of a higher functional head mediating between that head and 
the focused phrase. Limbum provides overt evidence for both, the head and the 
particle. 


1 Introduction 


The present paper discusses two focus strategies in Limbum (Grassfields Bantu, 
Cameroon) that can be distinguished on the basis of different focus markers and 
the types of focus they convey. The constructions and their respective focus 
markers are shown in (1)! below. 


Translations are modeled after the interpretations the focus strategies come with. Small capi- 
tals, as in (1a), signals pitch accent, an intonation strategy English makes use of. A cleft struc- 
ture is chosen as a translation if the sentence conveys an exhaustive meaning, see (1b). 
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(1) a à Nfor(ci) mÈèbi ta 
Foc Nfor COMP 1sG FUT1 send 
‘I will send NFon? 
b. mà bí ta ba Nfor 
1SG FUT1 send Foc Nfor 


‘It is Nfor whom I will send’? 


The sentence in (1a) shows the focus marker 4, consistently followed by the 
fronted constituent that is focused, in turn followed by an optional element, 
which we label complementizer for now. Note that this element occurs exclu- 
sively with this type of focus construction, which we will address in more detail 
in $3.2. The second strategy, shown in (1b), involves the marker bá, which con- 
sistently occurs left adjacent to the focused constituent. 

We will show that the á construction, although appearing similar to cleft con- 
structions, does not mark contrastive/exhaustive focus, but rather information 
focus.? For exhaustive focus, only the bá strategy is felicitous.* This is rather sur- 
prising, since the structurally more complex construction with á and fronting of 
the focused constituent is used to convey the "simpler" kind of focus, i.e. focus 
without any additional semantic or pragmatic restrictions. This goes against the 
trend of focus marking observable cross-linguistically, where information focus 
is expressed with a canonical focus structure and contrastive focus with a rela- 
tively more marked structure (Zimmermann 2011). 82 briefly discusses semantic 
evidence for the focus constructions to necessarily express different types of fo- 
cus. In 83, we turn to the syntactic analysis of the á strategy where we argue 
against an underlying cleft structure and eventually adopt a feature-driven focus 
movement analysis along the lines of Cable (2010). 


2 Focus in Limbum: Interpretation 


Before we turn to the two constructions at stake, this section provides a brief 
overview of focus in general. Following Zimmermann & Féry (2010: 2), focus is 


? AII Limbum data in this paper are our own. They are based on the judgement of two native 
speakers of Limbum from Nkambe. 

"Note that Limbum also has the option of leaving focus completely unmarked. This strategy 
mostly patterns with á. The á strategy, however, imposes an existence assumption (Dryer 1996) 
on the context which is not required in the absence of a focus marking particle. For reasons of 
space, we cannot go into detail here, but see Driemel & Nformi (2018a). 

‘While bá necessarily expresses exhaustivity, it is not the only strategy Limbum offers to express 
such a type of focus. Example (17) presents a cleft structure which is also able to trigger an 
exhaustive interpretation. 
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“a classical semantic notion expressing that a focused linguistic constituent is se- 
lected from a set of alternatives”, i.e. focus marks the presence of alternatives 
(Rooth 1992; Krifka 2008). Focus is generally said to be involved in question- 
answer congruence, correction, and the marking of contrast, among other con- 
texts. 

The literature often distinguishes two main types, namely information focus 
and contrastive focus. The former signals the presence of contextual alternatives 
and often introduces new information. Therefore, we will use question-answer 
pairs to test for information focus. The latter type of focus comprises a num- 
ber of subtypes, all of which add semantic and/or pragmatic conditions on the 
alternatives laid out by the presence of focus. In this paper, we will consider: 


Information focus: marks the presence of alternatives 


(2) Whog stole the cookie? 
[PEter]g stole the cookie. 


Contrast: an explicit alternative is present; often within the same utterance 
(3) An [AMzrican]g farmer talked to a [CaNAdian]r farmer. 


Correction: an explicit alternative from a previous utterance is rejected by giving 
a new explicit alternative 


(4) [peter] stole the cookie. 
No, [MAry]r did it. 


Exhaustivity: one (set of) alternative(s) is selected; all non-selected alternatives 
are false (Szabolcsi 1981; Kiss 1998; Vallduví & Vilkuna 1998; Horvath 2010; 
2013), e.g. 


(5) Hungarian 
Anikóa templomba ment be, (máshová nem ment be). 
Anikó the church.into went in.PRv elsewhere not went in.PRV 


‘It was the cHURCH that Anikó entered (and nowhere else). 


"For reasons of brevity, we cannot discuss all possible types with respect to the focus strategies 
in Limbum in the present paper. To just name a few other important types, selection features an 
explicit set of alternatives, from which one or more alternatives can be chosen; exclusivity has 
one (set of) alternative(s) selected, where at least one of the non-selected (set of) alternative(s) 
is false (van der Wal 2011; 2014), or only stronger alternatives on some scale are false (Beaver & 
Clark 2008; Coppock & Beaver 2012); unexpectedness involves the selected alternative to stand 
out (Zimmermann 2008; 2011; Hartmann 2008; Skopeteas & Fanselow 2009; 2011; Frey 2010; 
Zimmermann 2011; Destruel & Velleman 2014). 
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In this section, we will look at three context tests that show how the two focus 
markers are felicitous in different contexts in Limbum. Then, we will address 
exhaustivity in more detail and provide evidence for ba involving exhaustivity, 
while á does not. 

New information can be modeled with the help of an inquisitive context. Imag- 
ine the following scenario: 


(6) Context: A and B are talking on the phone, the connection is really bad. A 
was telling B that she was going to meet someone, but B could not under- 
stand the person's name. B asks A to repeat whom she is going to meet. 
A: á Ngàlá (cí) | méàbí k5ni 

roc Ngala COMP 1SG FUT1 meet 
‘I will meet NGALA? 
A’: #mé bi koni ba Ngàlá 
1SG FUT1 meet Foc Ngala 


‘It is Ngala whom I will meet’ 


In such a context, A can clarify who she is going to meet with the 4 marker, 
but not use bd. The latter, as will be shown in (7) and (8), requires an additional 
contrastive component. 

Corrective contexts require an utterance with an explicit alternative, which is 
followed by another alternative in a second utterance, automatically canceling 
the first one. In such contexts, the ba strategy is obligatory: 


(7) Context: A bought a pair of shoes. B does not remember correctly and tells 
someone that A bought a dress. A corrects B saying that she bought shoes 
(instead). 

B: i ba yt cè? 
3sG PST2 buy dress 
‘She bought a dress: 
A: #a  blábá?(cí mé ba yú 
FOC shoes COMP 1SG PST2 buy 
‘I bought sports 
A’: méba yt bá blábá? 
1SG PST2 buy Foc shoes 
‘It is shoes that I bought: 
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In order to correct B’s statement, example (7) shows that ba now becomes licit, 

while á cannot be used to mark focus any longer in the presence of correction. 
A similar effect can be observed with the expression of contrast. Again, only ba 

is felicitous for contrasting two arguments, á being not acceptable in this context. 


(8) a. Tánkókí n5  mndzip Ngàlácí n5 bá hee 
Tanko HAB drink water Ngala but drink roc blood 
"Tanko drinks water but it is blood that Ngala drinks: 
b. *Tánkó ki n5  mndzipá Nee ci Ngàlá n 
Tanko HAB drink water roc blood but Ngala drink 


‘Tanko drinks water but Ngala drinks BLOOD? 


To test for exhaustivity, we will apply tests that have been proposed by Kiss 
(1998): combining exhaustively focused constituents with also or universal quan- 
tifiers is infelicitous since they both semantically contradict exhaustivity. As ex- 
amples (9B)° and (10B) show for the subject and object, respectively, the á strat- 
egy is able to occur with also scoping over the constituent in focus. The marker 
bá, on the other hand, does not allow for focused constituents including an also 
phrase, see (9B’)’ and (10B’). This behaviour is consistent with contrast and cor- 
rection scenarios, given in (8) and (7), i.e. contexts that involve exhaustivity. 


(9 A: Nfóà mū yü "kär, 

Nfor 3sG PST2 buy car 
‘Nfor bought a car. 

B: a Ngàláfóy à mū yü "kär 
FOC Ngala also 3sG PsT2 buy car 
‘NGALA bought a car, too: 

B’: #à mua yü ba Ngalarka fon. 

EXPL PST2 buy roc Ngala car also 


‘It was also Ngala who bought a car’ 


“While í encodes a 3sG pronoun, both à and í seem to function as 3sc subject markers, i.e. they 
can optionally co-occur with NP subjects. 3sG pronoun ícan be seen in (7B), (10B-B’), and (21a). 
3sG subject markers are realized either as à, see (9A-B), (10A), (14a), (16), (17), and (31), or as í 
see (13a), (18), and (19). 

7 As can be observed in (9), subject focus comes with an additional restriction for bá focused 
constituents, in that they can only occur postverbally. Glossing à as EXPL is only one option 
and might not be the most convincing one, since typical expressions involving expletives such 
as weather verbs, locative inversions, or existential constructions do not occur with à. An 
alternative is to analyze à as a default marker since it is identical to the 3sG subject marker. 


223 


Laura Becker, Imke Driemel & Jude Nformi Awasom 


(10 A: Nfo à mp yü rkar. 

Nfor 3sG PST2 buy car 
‘Nfor bought a car: 

B: a ntümntümfóg í mū yú. 
roc motorbike also 3sc Psr2 buy 
‘He bought a MOTORBIKE, too. 

B': #i mp ya bá ntümntüm fon. 

3sG PST2 buy Foc motorbike also 


‘It was also a motorbike he bought: 


Using a universal quantifier inside of the focused constituent, we get the same 
effect: the universal quantifier is incompatible with exhaustivity because it inher- 
ently makes reference to all alternatives from a set, whereas exhaustivity entails 
that some alternative is selected from the set, excluding others. Again, examples 
(11) and (12) illustrate for focused subjects and objects that á, as predicted, is com- 
patible with universal quantifiers, while bá is not: 


(1) a a ywé nsip(ci) a ba zhé baa subject focus 
FOC person all comp asc PST1 eat fufu 
‘EVERYBODY ate fufu? 
b *a ba zhebá nwé  nsip baa 
EXPL PST1 eat FOC person all fufu 


‘Tt is everybody who ate fufu’ 


(2) a á pwè nsip(c  màbí kant object focus 
Foc person all (comp)I FUT1 meet 
‘I will meet EVERYBODY: 
b. *mébi  koni ba wè  nsip 
I FUT1 meet Foc person all 


‘It is everybody that I will meet: 


3 The syntax of a 


Focused constituents that are preceded by the focus marker 4 have to occur 
clause-initially. They can be followed by what we have so far glossed as the com- 
plementizer ci. 
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(13) a. a Nfo (ci) í ba zhé baa subject focus 

Foc Nfor COMP 3SG PsT1 eat fufu 
"NFOR ate fufu? 

b. á Ngàlá (cí) | mé bí k5ni object focus 
roc Ngala COMP 1sG FUT1 meet 
‘I will meet NGALA? 

c. à  àyàgsé (cí) si bífü yé Shey adverbial focus 
FOC tomorrow COMP 1PL.INCL FUT2 see Shey 


“We will see Shey TOMORROW. 


Similar to many West African languages (Koopman 1984; Ameka 1992; Man- 
fredi 1997; Biloa 1997; Aboh 1998; 2006), verb focus in Limbum is realized by 
doubling of the verb. Note that the higher copy of the verb differs from the lower 
copy in that it is prefixed with a noun class marker.® 


(14) Verb focus:? 
a. a r-gwe(ci) ndap D à Ø  gwè intransitive 
Foc 5-fall comp house DET 3sG PERF fall 
"The house FELL? 
b. á r-yü (ci) njinwé D bí yú msay transitive 
FOC 5-buy COMP woman DET FUT1 buy rice 


"The woman will BUY rice? 


3.1 Against a biclausal structure 


As was shown in the previous section, the á strategy contrasts with the bá strat- 
egy in that it is compatible with non-exhaustive contexts. This provides our first 
argument against an underlying biclausal cleft structure, as those are typically 
found with an exhaustive meaning component (Horn 1981; Percus 1997). In this 
section, we provide three syntactic arguments against a cleft structure. 

Based on sentences like (15) in which á seems to act like a copula, Fransen 
(1995: 301) concludes that the high focus marker strategy constitutes a cleft. 


SNouns which are formed from verbs via prefixing of the noun class 5 marker r- are generally 
the gerundive form of the verb (Nformi 2017). In such derivations, the tone of the noun class 
prefix lowers the tone of the verb root if it is a H tone verb (14b). The infinitive form of the 
verb in the language also looks similar to the gerundive but differs in that it has the infinitive 
marker d. 

This focus construction cannot be used to express TAM focus. It can, however, express VERUM 
focus. 
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(15) ártee 
? palm.tree 


‘It is a palm tree: 


An alternative analysis of (15) takes copulas to be silent while á acts as a focus 
particle. This idea predicts copulas to show up as soon as they have to act as 
hosts for negation and/or tense affixes. Adding an overt tense marker to á is 
ungrammatical, see (16). As predicted, the only way to save the structure is by 
using a copula and an expletive, see (17). 


(16) (*mü) 4 (*mü) baa (ci) Nfo à ba zhe 
PST2 FOC PsT2fufu comp Nfor 3sc PST1 eat 


*Nfor ate FUFU: 


(17 à mt bà baa Nfó à mū zhe 
EXPL PST2 COP fufu Nfor 3sG PsT2 eat 
‘Tt was a fufu that Nfor ate? 


Our second and third argument concern the cleft clause. Extraposition (Akmajian 
1970; Gundel 1977; Percus 1997) as well as predicative approaches (Svenious 1998; 
Hedberg 2000; Reeve 2011) uncontroversially take cleft clauses to be embedded 
relative clauses. In Limbum, there is ample reason to doubt the existence of a 
relative clause in an á construction. While the complementizer cí is optionally 
spelled out following the focused constituent, it cannot, however, act as a relative 
pronoun. 
(18 ma zhi/*ci i mp zhéé mggàmbé 

child REL / COMP 3SG PsT2 eat plantains 

‘the child who ate plantains’ 
Furthermore, relative clauses can optionally co-occur with the right-headed 
demonstrative marker nd (Fransen 1995; Mpoche 1993), shown in (19). Crucially, 
the demonstrative is prohibited in the 4 strategy, see (20). 
(19) mū zhii mp zhéé mygdmbé (na) 

child REL 3sG PST2 eat plantains DEM 


'the child who ate plantains' 
(20) á  pkfau(cí mé bí k5ni (*na) 
Foc chief COMP 1SG FUT1 meet DEM 


‘I will meet the CHIEF, 
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To sum up, a biclausal cleft structure requires a copula and and a relative clause, 
neither of which seems to be present in the á construction. 


3.2 Focus movement analysis 


In line with what has been argued for question particles in Japanese (Hagstrom 
1998), Sinhala (Kishimoto 2005), and Tlingit (Cable 2010) on the one hand and 
focus fronting in Hungarian (Horvath 2007; 2010; 2013) on the other, we propose 
that the focus particle á merges with a constituent that is focused (or at least 
contains a constituent that is focused). The particle heads its own projection FP 
and bears an «F» feature. This feature projects up to FP enabling the contained 
constituent to be focused. A higher functional head, optionally spelled out as 
ci, probes for the feature, finds it on FP and, as a consequence, attracts FP (and 
everything contained in it) to its specifier, see Figure 1.!° 
The alternative proposal in which á itself spells out the focus head and attracts 
the focused constituent to its specifier, sketched in (21b), can be refuted based 
on the linear order of the structures: á would be predicted to follow the focused 
constituent, contrary to fact. An ad-hoc movement step of á to a higher (possibly) 
C or Force head is ruled out based on the behaviour of focused constituents in 
embedded clauses. 
(21) a í bà lá nè á rkárf» (ci)  ndü zhià m yú 
3SG PST1 Say COMP FOC car DET COMP husband her 3sc PsT3 buy 
‘She said that her husband bought the car’ 


b. * [vp [v lá]Lcp [c ng] [roce rkár f5 [Foc á] [Fin [rin Ci 1111] 


The complementizer nê would block movement of á to C, nevertheless á precedes 
the focused constituent. Hence, we assume the left periphery of the embedded 
clause in (21a) to be composed as shown in Figure 1. 

Support for the FP analysis comes from the fact that cícan only occur in clauses 
realizing the á strategy. Thus, cí seems to be tied to the presence of á focus. Under 
the account, presented in (21b), this obligatory co-occurrence would be a coinci- 
dence. Limbum, therefore, is strikingly different from Japanese, Sinhala, Tlingit, 


The exact nature of feature F and FocP and how they differ from focus on the contained con- 
stituents that needs to be interpreted is not entirely worked out in this paper. Based on the 
claim in footnote 3, it is possible to reanalyze F and FocP as triggers for movement that have 
a semantic impact, in the spirit of Horvath (2007; 2013). This analyzes will have consequences 
for the information focus status of the á strategy and its relation to contrastive focus, both of 
which are explored in Driemel & Nformi (2018a,b). 
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Figure 1: Focus movement of FP 


and Hungarian in that it provides overt evidence for both the locally merged par- 
ticle as well as the higher functional head which causes overt movement. The 
functional head must be different from C, since an additional complementizer 
can co-occur with ci, as (21a) shows. Moreover, ci can never act as a complemen- 
tizer on its own, it is dependent on the occurrence of 4. 

Limbum patterns with Tlingit, in that the particle takes the focused phrase as a 
complement rather than adjoining to it. FP as a projection of F bears the F-feature 
probed for by the Foc head. Since FP properly contains the focused phrase, the 
entire FP is expected to move to spec,FP, including possibly non-focused material. 
In other words, focus movement is predicted to pied-pipe. (22) shows the inability 
of possessors to be extracted by themselves, they obligatorily have to pied-pipe 
the possessum. 


(22) Context: A heard B telling someone on the phone that B would pick up 
someone’s brother from the bus station. A couldn’t properly understand 
whose brother B will pick up. 

A: a ndtr nda (ci) à bí Dn 
Foc brother who comp 2sG FUT1 pick.up 


"Whose brother will you pick up?' 
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B: à ndár Tánkó(c mébi Brī 
Foc brother Tanko comp 1sG FUT1 pick.up 
‘I will pick up TANKO's brother: 

B’: "A  Tánkó(cí mébi liri ndür 
Foc Tanko COMP 1sG FUT1 pick.up brother 


‘I will pick up TANKO's brother. 


An alternative account like the one shown in (21b) cannot predict pied-piping 
without assuming further constraints on movement. Whichever phrase is fo- 
cused, and thus bears an F feature, would be predicted to move to spec,FocP, 
see (23) for an illustration. 


(23) *[.. ái [rocp [pp Tánkór]» [roc t1] [TP + Lop Lo’ [np ndár] Øp] t]..]] 
t * j 


In the current analysis the FP is the closest goal the Foc head sees. It is therefore 
the entire FP that gets attracted to the specifier of FocP, making it impossible for 
a focused phrase contained in an FP to move to spec,FocP on its own, see (24). 


(24)  [rocP , , [Foc cí] [rp --- [Fp á [pp [bv [np ndúr] Øp] Tänkér]]...]] 


A | 


Extractions of the type shown in (23) can potentially be ruled out by general con- 
straints on movement since they seem to be marked cross-linguistically (Corver 
1990; Boëkovié 2005). We would like to point out, however, that possessor ex- 
traction is not banned per se, since it is allowed in topic configurations, shown 
in (25a), albeit with a resumptive pronoun. A base-generation approach seems 
implausible since topicalization is less acceptable out of islands, shown e.g. in 
(25b) for a complex noun phrase. 


(25) a. à mbò Tanko, më mū yé nfi zhi 
as for Tanko isc PST2 see brother his 
‘As for Tanko, I met his brother? 
b. ?à mbò Tanko, mè rin pwe zhi mū kóní nfi zhi 
as for Tanko 1sc know man REL PST2 meet brother his 
‘As for Tanko, I know a man who met his brother 
Since the possessor can, in principle, move out of the DP it is contained in, we 


conclude that it must be the focus particle 4 merged with the entire DP that 
prevents the possessor from moving to spec,FocP alone. 
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Another environment in which we can observe the pied-piping property of fo- 
cus movement concerns prepositional phrases, shown in (26). Prepositions can- 
not be stranded if the phrase they merge with is narrowly focused. 


(26) Context: A heard B telling someone on the phone that B shot an animal 
with something but it is not clear to A with what. 
A: á ni kē (ci) wé mi ta nya à? 
FOC with what COMP 2sG PsT2 shoot animal Q 
"With what did you shoot the animal?’ 
B: á ni ngär(ci) mà mü tà nya 
FOC with gun COMP 1SG PST2 shoot animal 
‘I shot the animal with a GUN: 
B’: “à ngär(ci) mé mü tà  nyà m 
FOC gun COMP 1SG PST2 shoot animal with 
‘I shot the animal with a GUN: 
Similar to the possessor case, the alternative account in which the focus particle 


á spells out the FOC head would predict the complement of P to be attractable 
to spec,FocP, in case it is the constituent that carries the F feature. 


(27) * [... ài [rocp [pp ngarrl» [Foc t1] [re --- [pp [p ni] tel 


( S j 


In contrast, the FP analysis predicts FP to be the goal that checks the F feature 
on the FOC head. Hence, the entire PP has to move to spec, FocP. 


(28) [roc , [Foc cí] [rp --- [Fp á [pp [p ni] ngārr]..]] 


a j | 


Again, conditions on preposition stranding can be independently motivated, 
since this kind of movement seems to be banned in a number of languages (Abels 
2003; Heck 2008). The FP analysis, however, offers an explanation for the lack 
of preposition stranding and possessor extraction simultaneously. 

At this point, it is important to answer the question why the focus particle can- 
not merge directly with the narrowly focused constituent in (24) and (28). Here 
we follow Cable (2010) by adopting the QP-Intervention Condition reformulated 
for FPs. 
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(29) FP-Intervention Condition: (adapted from Cable 2010: 57) 
An FP cannot intervene between a functional head o and a phrase 
selected by a. (Such an intervening FP blocks the selectional relation 
between a and the lower phrase.) 


By assumption, functional heads c-select for their arguments, while lexical heads 
s-select for their arguments (Cable 2010: 62). An FP can intervene between a lex- 
ical head and the phrase selected by that head because the F particle does not 
change the semantic type of the phrase it merges with. An FP cannot, however, 
intervene between a functional head and the phrase it selects for since the F par- 
ticle indeed changes the category of the phrase it merges with. Hence, á cannot 
merge with the embedded XP of a prepositional phrase because it would inter- 
vene between the functional head P and XP. Neither can á directly merge with a 
possessor because the functional element D c-selects its possessor and á would 
act as an intervener. 

Further support for (29) comes from VP-fronting, here analyzed as remnant 
vP-fronting. If á were to take vP as its complement, the particle would intervene 
between vP and the higher functional head T. As a consequence, VPs cannot (per 
se)!! be focused with the á strategy. 

(30) A: á ke (c njinwé f5 bí à 
FOC what COMP woman DET FUT1 Q 
"What will the woman do?' 
B: “4 (r)yü msän (ci) njinwé f5 bi 
FOC 5-buyrice COMP woman DET FUT1 


"The woman will Buy RICE? 


While the ban on P-stranding and possessor extraction might be reducible to 
the interplay of the prc (Chomsky 2000) and Anti-locality (Abels 2003; Erlewine 
2016), this crucially does not apply to the lack of VP fronting because TPs are 
uncontroversially denied phasehood status. The impossibility to front a VP in 
(30), thus, requires an independent explanation. In contrast, the FP analysis can 
capture all three properties of the á strategy. 


"There is, however, a way to repair the structure using do-support: 


(i) á r-ya msan(ci) njinwé f5 bí gi 
Foc 5-buy rice COMP woman DET FUT1 do 


"The woman will BUY RICE. 


At this point, it is unclear to us why do-support is able to save the construction. 
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Finally, a note on verb focus is in order. As (14a) and (14b) show, verb focus 
requires doubling on the one hand and a noun class marker prefixing the higher 
copy on the other hand. The latter suggests that the focus particle 4 c-selects 
for nominal phrases, so that verbs have to be nominalized in order to be merged 
with 4. The behaviour of the focus particle is not unusual for Limbum since co- 
ordinators seem to make the same kind of distinction. As (31) shows, the choice 
of coordinator correlates with the categories of the conjuncts.l? 


(3) a Sheya mū ró Njobe bá Shey 

Shey 3sG PsT2 search Njobe and Shey 
‘Shey searched for Njobe and Shey. 

b. Sheya mū ró Njobe mantaa ba ko la? 
Shey 3sG PsT2 search Njobe at market and at home 
‘Shey searched for Njobe at the market and at home: 

c. Sheyà mt cana gwè 
Shey 3sc PsT2 run and fall 
‘Shey ran and fell? 


Since categorical sensitivity shows up elsewhere in the language, we tentatively 
conclude that the noun class prefix in verb focus constructions is due to a selec- 
tional restriction of 4. Attaching a noun class prefix to one of the copies could 
potentially serve as a reason for multiple spell out, i.e. doubling. A detailed anal- 
ysis, however, is still missing and left for future research. 


4 Summary and future work 


In this paper, we have shown that the two focus strategies in Limbum, involv- 
ing two different markers, also clearly differ in their functions: the marker á is 
linked to information focus (i.e. focus with no further semantic/pragmatic con- 
ditions), while bá occurs in contexts that involve contrast and exhaustivity. The 
interpretation effects that the á strategy triggers are compatible with the syn- 
tactic analysis: the lack of tense marking on copulas, the behaviour of the com- 
plementizer cí, and the ban on right peripheral demonstrative markers provide 
evidence against an underlying cleft structure. The current proposal, therefore, 
models á focus as focus movement, where the focus particle is directly merged 


”Limbum shows a great deal of homophony (compare also the use of cías a sentence coordinator 
in (8) vs. the general use of cí with respect to focused á phrases), which could account for the 
fact that á and bá can act as coordinators as well as focus particles. Alternatively, coordinators 
and focus particles could also be related diachronically. This issue must be left open for now. 
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with the focused phrase and attracted to the left periphery by a higher functional 
head, pied-piping the focused constituent. While this type of analysis has been 
proposed for other languages (Hagstrom 1998; Cable 2010), albeit for questions, 
Limbum crucially provides morphological evidence for the existence of a particle 
(á) as well as the higher functional head (ci). 

Even though a cleft analysis is ruled out, the Limbum patterns, shown in this 
paper, nevertheless present a so-far unattested opposition of focus strategies: in- 
formation focus, being less marked semantically, is expressed by a complex strat- 
egy consisting of a particle and fronting, whereas contrastive/exhaustive focus, 
although imposing additional semantic restrictions on the focus alternatives, is 
realized by a particle only. The reasons why Limbum shows the reverse picture 
in terms of structural markedness and complexity of interpretation need to be 
explored further in future work. 

One last point concerns the syntax of bá. In contrast to the á strategy, the 
bá construction does not seem to provide overt evidence for the existence of a 
higher functional head. The behaviour of focused subjects, however, indicates 
certain positional restrictions a focused phrase has to obey. Future work will 
explore whether the FP analysis can be extended to the bá strategy. 
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Abbreviations 

1,2,3 1st, 2nd, 3rd person PL Plural 

1-,2-,5-, Noun classes PREP Preposition 

come  Complementizer PRV Preverb 

COP Copula PST1 Recent past tense 

DET Determiner PST2 Distant past tense 
DEM Demonstrative PST3 Remote past tense 
EXPL Expletive REL Relative pronoun 

FOC Focus marker SG Singular 

FUT1 Near future tense : High tone 

HAB Habitual ; Low tone 

INCL Inclusive À Mid tone 

PERF  Perfective 


P Although see Driemel & Nformi (2018a) for a possible explanation. 
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Universal quantification in the nominal 
domain in Kihehe 


Kelly Kasper-Cushman 


Indiana University 


This study provides a description of the universal quantifiers mbe-AGR2-li ‘all’ and 
kila ‘every’ in Kihehe, a Bantu language spoken in south-central Tanzania. Follow- 
ing a description of general properties of these quantifiers, this study analyzes how 
the Kihehe data bear on the phenomena of collectivity and distributivity, and situ- 
ates the Kihehe data with regard to recent crosslinguistic work on the typology of 
quantifiers (Matthewson 2013). 


1 Introduction 


This paper presents a description of the universal quantifiers mbe-Acn2-li ‘all’ 
and kila ‘every’ in Kihehe (G.62; Maho 2009), a Bantu language spoken in south- 
central Tanzania around the town of Iringa. Regarding quantification in Bantu, 
Zerbian & Krifka (2008: 383) note that “few studies exist which touch upon quan- 
tification [in (whatever) Bantu languages]” (but see Landman 2016; 2019 for two 
recent works). Additionally, this documentation contributes to an understanding 
of quantifiers crosslinguistically, and to the development of a typology of quan- 
tifiers, as called for by Matthewson (2013) (for recent studies on quantification 
in natural languages, see the works in Gil et al. 2013; Keenan & Paperno 2012; 
Matthewson 2008 and Paperno & Keenan 2017). 

82 and §3 describe general properties of mbe-Acn2-li ‘all’ and kila ‘every, re- 
spectively. 84 considers how the data in Kihehe bear on the properties of col- 
lectivity and distributivity associated with the universal quantifiers (see, e.g., Sz- 
abolcsi 2010). Finally, 85 situates the Kihehe data with regard to three crosslin- 
guistic typological generalizations of quantifiers put forth in Matthewson (2013). 


Kelly Kasper-Cushman. 2019. Universal quantification in the nominal domain in Kihehe. In 
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2 Mbe-AGR2-li ‘all’ 


This section provides an overview of the universal quantifier mbe-AGR2-li ‘all’! 
It starts with a discussion of general agreement patterns before providing a de- 
scription of coordination and the partitive construction. 


2.1 General properties 


The quantifier mbe-AGR2-li follows the noun it modifies and agrees with it in 
noun class, as shown in (1) in which it modifies the plural Class 2 noun vanu 
‘people’ The agreement marker, labeled AGR2, is an infix. 


(1) va-nu | mbé-va-li 
2-person all-2-all 


‘all people'/'all of the people"? 


The morphological form of the agreement marker for this quantifier patterns 
with that for demonstratives, numbers, and possessives, as shown in (2a). This is 
unlike the morphological form of the agreement marking on adjectives, which 
is identical in form to the noun class marker, as in (2b) (this latter pattern in- 
cludes the quantificational adjectives -olofu ‘many/much; -ongefu ‘many/much; 
and -kefu ‘few’; however, -ngi ‘other’ patterns with demonstratives). When other 
modifiers are present in the NP, as in (2a), mbe-AGR2-li appears last. 


(2 a. i-mi-biki gi-tayi mbe-ge-li 
PPF-4-tree 4-four all-4-all 
'all four trees' 
b. i-mi-biki my-ongefu 
PPF-4-tree 4-many 


€ H 
many trees 


A singular noun can be modified by mbe-aGr2-li. Example (3) shows a noun 
from Class 3; here, all + N is translated as ‘the whole N° 


"The meaning of the individual components mbe- and -li is not clear; these components always 
appear together with an infixed agreement marker in this construction. 

?Tone is marked on examples where transcription is reliable. Data are transcribed using ortho- 
graphic conventions based on Kiswahili. Vowel length is not typically marked. Syllabic nasals 
are marked. 
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(3) m-víli mbé-gu-li 
3-body all-3-all 
‘a/the whole body’ 


Mbe-AGR2-li can take locative noun class agreement. With the Class 16 agree- 
ment (pa-), for example, the quantifier translates as ‘all of the time’ 


(4 mbé-pa-li 
all-16-all 
‘all of the time’ 


The quantifier can agree with a plural personal subject (see also Jerro 2013). 


(5 Tu-bit-e | mbé-tu-li! 
1PL-go-IMP all-1Pr-all 


‘Let’s all go!’ 


There is no separate lexical item to express both; mbe-AGR2-li is used in this 
context. If disambiguation is necessary, the number -vili 'two' is inserted, as in 


(6). 
(6) A-v-ana  va-vili mbe-va-li va-tamw-a. 
PPF-2-child 2-two all-2-all 2-be sick-Fv 
*Both of the children are sick? 


In citation form, the head noun modified by mbe-AGR2-li can optionally occur 
with the pre-prefix. However, when the head noun is an argument, the head noun 
must appear with the pre-prefix; compare example (7) with example (1); see also 
the discussion in $5 and Gambarage (2016). 


(7) A-v-ána  mbé-va-li va-tamw-a. 
PPF-2-child all-2-all 2-be sick-rv 
‘All of the children are sick? 


Note that the pre-prefix is required even if a demonstrative is present as in (8) 
(see also the discussion in Matthewson 2013: 32-33). 


(8) A-va-na  i-v-o mbe-va-li va-tamw-a. 
PPF-2-child PROX.DEM-2-ADD all-2-all 2-be sick-rv 
‘All of those children are sick? 
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Mbe-AGn2-li can stand alone as a pronoun, as shown in (9-10). Example (10) 
shows a copular structure with a predicate noun. 


(9) Mbé-se-li si-fw-e. 
all-10-all 10-die-PsT.Fv 
‘All (of them) died: 


(10) Mbé-va-li v-ana. 
all-2-all 2-child 
‘All are children? 


Finally, an example of the universal quantifier in a text is given in (11). The ex- 
ample is from Luwuko (2011: 9:25), the Kihehe translation of the Book of Exodus. 


(11) Kihehe (Luwuko 2011: 9:25) 
Na i-ndonya i-nya-ma-ganga y-a-denyanz-ile 
and PPr-9.rain PPF-of-6-stone 9-P3-strike down-PsT 
i-fi-melo mbe-fi-li i-fy-a mu-mi-gunda, 
PPF-8-seedling all-8-all PPr-8-Assoc Loc=4-field, 
y-a-nanz-ile i-mi-biki mbe-gi-li i-j-a 
9-p3-destroy-PsT PPr-4-tree all-4-all PPr-4-Assoc 
mu-mi-gunda. 
Loc-4-field 
"Then the hail struck down all of the seedlings in the fields, and 
destroyed all of the crops in the fields: 


2.2 Coordination with mbe-AGR2-li 


In a coordinate structure such as all of the N; and No, mbe-aGr2-li can modify 
either N; or both N; and Nj. Mbe-Acn2-li can be placed after either the first 
noun or the second noun in the coordinate structure and have the interpretation 
‘all N; and all N3; although the most natural reading will be that of mbe-AGR2-li 
modifying the closest N (compare 12a with 12b). To disambiguate, mbe-Ac2-li can 
be placed after both nouns, as in (12c). 
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(12) a. Structure: all N, and N, 

a-va-kwámisi mbé-va-li n(a) a-vá-híinza 
PPF-2-boy  all-2-all and ppr-2-girl 
Most natural interpretation: 'all of the boys and some (not all) of the 
girls’ 
Possible interpretation: ‘all of the boys and all of the girls’ 

b. Structure: N; and all N3 
a-va-kwamisi n(a) a-va-hiinza mbe-va-li 
PPF-2-boy and ppr-2-girl all-2-all 
Most natural interpretation: ‘some (not all) of the boys and all of the 
girls’ 
Possible interpretation: ‘all of the boys and all of the girls’ 

c. Structure: all N; and all N3 
a-va-kwamisi mbe-va-li n(a) a-va-híinza mbe-va-li 
PPF-2-boy  all-2-all and rrr-2-girl all-2-all 
‘all of the boys and all of the girls’ 


In the examples in (12), both nouns in the coordinate structure belong to the 
same noun class. Nouns from different noun classes can be coordinated using 
the structure all of the N; and N», but the most natural interpretation of that 
structure is ‘all ofthe N; and some of the N3? When the intended meaning is ‘all 
of the N; and all of the N3; it is more felicitous to modify both nouns with all. 


(13 a. a-va-nu mbe-va-li n(a) i-senga 

PPF-2-person all-2-all and PPr-10.cow 
Most natural interpretation: 'all of the people and some (not all) of 
the cows' 
Possible interpretation: "all of the people and all of the cows' 

b. a-va-nu mbe-va-lin(a)i-senga  mbe-se-li 
PPF-2-person all-2-all and ppr-10.cow all-10-all 
'all of the people and all of the cows' 


2.3 Partitive constructions with mbe-AGR2-li 


In English, there is a surface difference between quantified nouns in partitive 
constructions (e.g., all of the N) and non-partitive constructions (all N); for ex- 
ample, I went to a party last night and talked to "all linguists/all of the linguists 
(example adapted from an example in Matthewson 2001: 170; see Matthewson 


243 


Kelly Kasper-Cushman 


2001, and Matthewson 2013, for a discussion of the debate regarding the parti- 
tive construction and the semantic denotation of quantifiers). In Kihehe, there 
is no overt morphological element corresponding to of in English partitive con- 
structions. Additionally, all nouns modified by mbe-AGn2-li must first combine 
with the pre-prefix. Thus, we find a lack of surface contrast between contexts 
in which the quantified noun is contextually restricted versus non-contextually 
restricted, for both count nouns, as in (14), and mass nouns, as in (15). 


(14) a. Generic 
Lsenga  mbe-se-li s-ay-i-ly-(a) a-ma-soli. 
PPF-10.cow all-10-all 10-HAB-PRs-eat-FV PPF-6-grass 
‘All cows do eat grass’ 
b. Contextually restricted 
Ke-nyele ku-m-nada n-gus-is(e) i-senga  mbe-se-li. 
AUX=1SG.g0.PST LOC-3-auction 1sG-sell-PsT PPr-10.cow all-10-all 


‘I went to the auction and sold all of the cows: 


(15 a. Generic 
Nd-i-wend-(a) i-fy-ayi mbe-fe-li. 
1SG-PRS-like-rv PPF-8-tea all-8-all 
‘T like all tea’ 


b. Contextually restricted (Context: I made two pots of tea a while ago.) 
Lfy-ayi mbe-fe-li fy-e=nelike fi-2-pos-ile. 
PPF-8-tea all-8-all 8-REL-1sG.cook.PsT 8-Pi-be cool-PsT 
‘All of the tea which I made is cold: 


3 Kila ‘every’ 


This section presents a general overview of the universal distributive quantifier, 
kila, ‘every. Kila is a borrowing from Kiswahili (which borrowed it from Arabic; 
Zerbian & Krifka 2008). It is unknown how recently kila was borrowed into Ki- 
hehe, but evidence suggests that kila is a recent borrowing. It does not appear 
in Spiss’s (1900) Kihehe-German dictionary, and it is not used in the Luwuko 
text. However, kila is used commonly by Kihehe speakers, except for the oldest 
generation. 
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3.1 General properties 


Similar to what Zerbian & Krifka (2008) note for Kiswahili, in Kihehe kila must 
appear before the noun it modifies. It does not show agreement in noun class, 
and the noun modified by kila may not occur with the pre-prefix.? 


(16 a kila mw-ana 
every 1-child 
‘every child’ 
b. * mw-ana kila 
1-child every 
‘every child’ 
c. “kila u-mw-ana 
every PPF-1-child 


‘every child’ 


Kila can modify both singular nouns, as in (16a), and plural nouns as in (17). In 
(17), kila N is interpreted as ‘every group of N’ (17). 


(17) Kila va-nu  v-ay-i-pig-ag-a i-kasi kwa u-m-twa. 
every 2-person 2-HAB-PRS-work-DUR-FV PPF-9.work for PPr-1-chief 


‘Every group of people does work for the chief? 


Finally, note that kila cannot stand alone as a pronoun. 


3.2 Kila in coordinate structures 


In general, kila seems to resist coordination. Structures such as every N; and N3 
without the repetition of kila were judged to be unnatural when the intended 
meaning was every N; and every N3. Additionally, for several contexts created 
to elicit coordination with kila, coordination with mbe-AGn2-li was judged to be 
more natural (or the only coordination structure possible). However, I was able 
to elicit a coordinated structure in the form of every N} and every N, which my 
native speaker consultant judged to be natural. This context would also permit 
the quantifier mbe-AGR2-li. Note that in the example (18a) with kila, the two coor- 
dinated nouns contain relative clauses, while in example (18b) with mbe-acr2-li, 
the coordinated nouns do not. 


?Steve Franks and an anonymous reviewer both point out that kila and the pre-prefix are in 
complementary distribution. This suggests that a quantified noun with kila is a Q + NP struc- 
ture, while quantification with mbe-AGR2-li is a Q + DP structure. This is discussed with regard 
to crosslinguistic generalizations in $5. 
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(18) Context: I went to Tanzania. While I was there, I went on a safari in a 
mbugayawayama (a Kiswahili word for ‘reserve’ or ‘park’). In the park... 
a. Nd-a-tov-ige i-picha y-a kila m-biki 

1SG-P3-beat-DUR.PST PPF-9.picture 9-Assoc every 3-tree 

gw.e=nd-a-gu-won-ige na kila ki-koko 

3.REL-1SG-P3-3-see-DUR.PST and every 7-animal 

ch.e-nd-a-ki-won-ige. 

7.REL-1SG-P3-7-Ssee-DUR.PST 

'I took a picture of every plant that I saw and every animal that I saw: 
b. Nd-a-tov-ige i-picha y-a i-mi-biki 

1S-P3-beat-DUR.PST PPF-9.picture 9-Assoc PPF-4-tree 

mbe-ge-li n(a) i-fi-koko mbe-fe-li 

all-4-all and ppr-8-animal all-8-all 

fy.e=nd-a-fi-won-ige. 

8.REL-1SG-P3-8-see-DUR.PST 


‘I took a picture of all the plants and animals that I saw: 


3.3 Kila: Like every or like each? 


There is no separate lexical item in Kihehe expressing each. For English, linguists 
have investigated several differences between each and every; three of these dif- 
ferences are summarized in Table 1. Considering these differences, one can ask if 
the uses of kila in Kihehe correspond more to the syntax and semantics of each, 
or more to those of every. 


Table 1: Some properties of each and every (Beghelli & Stowell 1997) 


Property each every 

Ability to float Can float Cannot float 

Modification by ‘almost’ Cannot be modified by Can be modified by 
almost almost 

Availability of Generic each N cannot be every N can be 

interpretation understood as a generic understood as a generic 


For the properties in Table 1, kila patterns with every, rather than each. First, 
kila cannot float. It can appear in only one position, preceding the noun it modi- 
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fies. Second, as shown in (19), kila can be modified by kalibiya ‘almost; which is 
a borrowing from Kiswahili. 


(19) Kalibiya kila ki-nu ki-sis-ile. 
almost every 7-thing 7-be used up-PsT 


‘Almost every thing is gone. 


Third, kila can be used to make generic statements, as shown in example (20). 


(20) Kila senga y-ay-i-ly-(a) a-ma-soli. 
every 9.cow 9-HAB-PRS-eat-FV PPF-6-grass 


€ > 
Every cow eats grass. 


«ce 


In addition to these properties, Vendler (1962: 150) states that "each. direct 
one’s attention to the individuals as they appear, in some succession or other, one 
by one” Consider example (21), based on a similar example in Vendler (1962: 150). 
The example suggests that in Kihehe, when the context establishes a one-by-one 


interpretation, kila becomes infelicitous.* 


“An anonymous reviewer notes that this example “suggests that Kihehe employs the typolog- 
ically widespread reduplication strategy for expressing ‘strong distributivity’ (in the sense of 
Beghelli & Stowell 1997)” and wonders if the reduplication method for expressing strong dis- 
tributivity blocks kila in appearing in such contexts. The use of reduplication to indicate dis- 
tributivity is also supported by the Kihehe data in (i), in which reduplication gives rise to an 
unambiguous distributive meaning of “2 mangos per child” (on reduplication and distributivity, 
see also Balusu & Jayaseelan 2013). 


(i) M-pel-e kila mw-an(a) a-ma-yembe ga-vili ga-vili. 
l-give-IMP every l-child  PPF-6-mango 6-two 6-two 
‘Give every child 2 mangos each’ 
A final point of comparison between English every and kila: Beghelli & Stowell (1997: 98) 
note that every allows collective readings, as in It took every boy to lift the piano (their exam- 


ple 33a). Kila does not allow collective readings. This is discussed in the following section in 
conjunction with example (26) and footnote 6. 
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(21) Context: After I purchased a basket of mangos, I noticed that some of them 
were starting to rot. I want you to look at each one carefully so that you 
can sort the mangos into those that are still good to eat, and those that are 
spoiled. I tell you: 


a. #Li-lav-e kila li-yembe. 
5-see-IMP every 5-mango 
'Examine every mango: 

b. oKli-lav-e  lí-mwí-lí-mwi. 
5-see-IMP 5-one-5-one 


‘Examine (them) one after another 


4 Collectivity and distributivity 


This section discusses the properties of collective and distributive interpretations 
with the universal quantifiers in Kihehe. Regarding -ote ‘all’ and kila in Kiswahili, 
a reviewer of Zerbian & Krifka (2008: 396) "suggests kila is inherently distributive 
whereas -ote is underspecified concerning the distributive/collective distinction? 
This section presents data which suggest that the behavior of quantifiers in Ki- 
hehe are compatible with the hypothesis that mbe-AGn2-li is underspecified for 
collectivity vs. distributivity, while kila is distributive. 


4.1 Distributive contexts 


In the contexts that are distributive, mbe-AGR2-li is acceptable. For the context 
presented in (22), kila is also acceptable as shown in (22b).° 


(22) Context: The chief was sitting in the front of the hall. Before the meeting 
started, the villagers went up individually to greet the chief. All of the 
villagers present had to greet the chief before the meeting started. Then at 
last, the meeting began. This evening... 


a. a-va-nu mbe-va-li va-mu-@-hunje u-mu-twa. 
PPF-2-person all-2-all 2-1-p1-greet.PsT PPr-1-chief 


'all of the people greeted the chief? 


"An anyonymous reviewer asks why kila is felicitous here (22b), while it is not felicitous in 
either (21b) or (23b). One possibility is that the context in (22) does not impose that the acts 
of greeting were conducted in strict succession, one after another. More research is needed 
to clearly define the properties of the contexts which permit or require one strategy (kila or 
reduplication) versus another. 
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b. Kila mu-nu a-m-@-hunje u-mu-twa. 
every 1-person 1-1-P1-greet.PsT PPF-l-chief 


'every person greeted the chief? 


However, kila is not acceptable in all distributive contexts. In example (23), 
mbe-AGR2-li is possible, but kila is not felicitous. As with example (21), the con- 
text creates an interpretation of ‘individually, one after another: 


(23) Context: The boys in the village were arguing about who was the strongest. 
Then one boy saw a big rock and said "Whoever throws the rock the far- 
thest is the strongest.” So one at a time, the boys threw the rock. What 


happened? 
a. A-va-kwamisi mbe-va-li v-a-hom-it(e) i-li-ganga. 
PPF-2-boy all-2-all | 2-P3-throw-PsT PPF-5-stone 


‘All of the boys threw the rock 
b. #Kila m-kwamisi (a)-a-hom-it(e) i-li-ganga. 
every 1-boy 1-P3-throw-PsT PPF-5-stone 
‘Every boy threw the rock? 
c. OKU-mu-nu yu-mwi-yu-mwi (a)-a-hom-it(e) i-li-ganga. 
PPF-1-person 1-one-1-one 1-P3-throw-PsT PPF-5-stone 


"The people threw the stone one after another. 


4.2 Collective contexts 


In contexts that are necessarily collective, only mbe-AGR2-li is felicitous. The 
use of kila creates distributive interpretations that are not compatible with the 
contexts. 


(24) Context: A fire destroyed someone's house in the village. The person was 
elderly and could not rebuild her house, so the other villagers decided to 
help her. This morning... 


a. a-va-baba mbe-va-li v-a-senz-ile — kangi i-nyumba. 
PPF-2-man all-2-all —2-P3-build-PsT again PPr-9.house 
y.e=ke=yi-ka-pye 
9.REL-AUX-9-P2-burn.PsT 
'all the men rebuilt the house which had burned: 
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b. #kila m-baba (a)-a-senz-ile kangi i-nyumba 
every 1-man 1-P3-build-PST again PPr-9.house 
y.e=ke=yi-ka-pye 
9.REL-AUX-9-P2-burn.PsT 
'every man rebuilt the house which had burned: 
Comment from native speaker consultant: This would have to mean 
that there were many houses being built. 


Szabolcsi (2010: 121) states that ^To test collective readings it is advisable to em- 
ploy punctual accomplishment verbs such as lift up, as opposed to lift, which has 
an activity reading? Based on this statement, the context in (25) was designed to 
test collective readings with the verb kunyanyula, ‘lift up? The consultant's judg- 
ments indicate that mbe-AGR2-li is acceptable with a collective reading, while 
kila requires a distributive reading. 


(25) Context: There was a child trapped under a wagon that had fallen over in 
the road. The child screamed, and people rushed to the wagon. In order to 
rescue the child... 


a. a-va-nu mbe-va-li va-S-nyanyuw(e) i-li-tololi. 
PPF-2-person all-2-all 2-p1-lift up.PsT — PPrF-5-wagon 
'all of the people lifted up the wagon: 
b. #kila mu-nu a-@-nyanyuw(e) i-li-tololi. 
every 1-person 1-P1-lift up.PsT PPF-5-wagon 
'every person lifted up the wagon: 
Comment from consultant: This would mean that everyone has his/her 
own wagon. 


However, as Robert Botne (pers. comm.) points out, we cannot be sure that 
kunyanyula is necessarily a punctual accomplishment verb (i.e., is it interpreted 
as ‘lift up’ or ‘lift’?). Additional evidence that kila requires a distributive interpre- 
tation while mbe-AGR2-li is acceptable with a collective reading comes from the 
pair in (26), modeled after the data in Zerbian & Krifka (2008: 396-397). These 
data also show that similar to what Zerbian & Krifka (2008: 397) describe for 
Kiswahili, in Kihehe kila is not grammatical with pamwi ‘together’ as in (26b).° 


"Note also that it is ungrammatical to add pamwi ‘together’ to the utterance in example as in 


(25b). 
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(26) a.  A-va-kwamisi mbe-va-li v-a-lavis-igé a-ma-shindano 
PPF-2-boy all-2-all 2-p3-watch-DUR.PST PPF-6-competition 
pamwi. 
together 
‘All of the boys watched the competition together: 

b. *Kila m-kwamisi (a)-a-lavis-igé a-ma-shindano pamwî. 
every 1-boy 1-P3-watch-DUR.PST PPF-6-competition together 


‘Every boy watched the competition together’ 


5 Kihehe universal quantifiers and cross-linguistic 
generalizations 


Matthewson (2013) calls for work towards a typology of quantifiers, and puts 
forth eight crosslinguistic generalizations based on a survey of 37 different lan- 
guages. In this section, I situate the Kihehe data with respect to three of the 
crosslinguistics generalizations that bear on syntactic differences among the uni- 
versal quantifiers.’ 

First, consider Matthewson’s Generalization 6: 


(27) Gen6: It is common for a word translated as ‘all’ to look as if it attaches to 
a full DP, even when other quantifiers do not. (Matthewson 2013: 35) 


As described above, the Kihehe data show that mbe-aGr2-li must combine 
with a head noun + pre-prefix when the head noun is an argument. The data are 
consistent with the hypothesis that the pre-prefix on nouns is a D, and one of 
its functions is to combine with NPs to form DP arguments. I come to this hy- 
pothesis applying the argumentation laid out in Matthewson (2001). Gambarage 
(2016) arrives at a similar analysis of the pre-prefix in Nata, a Bantu language 
also spoken in Tanzania; see his work for a much more exhaustive investigation 
of this claim as well as for references of number of authors who argue that the 
pre-prefix is a D; see Longobardi (1994) regarding Ds forming DP arguments. 

In support of this hypothesis, note that in Kihehe, the head noun of a DP ar- 
gument containing the quantifier mbe-AGR2-li must appear with the pre-prefix. 
Main clause quantified DPs are ungrammatical without the pre-prefix (28). 


"Landman has a similar goal for her (2016) study of Logoori; she finds that the generalizations 
from Matthewson (2013) under consideration for her study apply to the distributive quantifier 
vuri ‘every, but not to the quantifier -oosi ‘all? ultimately analyzing -oosi as a DP-internal 
modifier. Whether this analysis could extend to Kihehe mbe-AcR2-li is left an open question 
for future research. 
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(28) *Sénga mbé-se-li si-fwe. 
10.cow all-10-all 10-die.PsT 
‘All the cows died: 


While DP arguments must contain a pre-prefix on the head noun, the pre- 
prefix cannot occur on a predicate noun or a predicate adjective. When the pre- 
prefix does occur in these contexts, it creates a relative clause (29-30). 


(29 a. U-mu-nu  yu-la mu-twa. 
PPF-1-person 1-DEM.DIST 1-chief 
"That person is a/the chief? 
b. u-mu-nu  yu-la u-mu-twa 
PPF-l-person 1-DEM.DIST PPF-1-chief 


‘that person who is a/the chief’ 


(30) a. A-v-ana va-tali. 
PPF-2-child 2-tall 
‘The children are tall’ 
b. a-v-ana a-va-tali 
PPF-2-child ppF-2-tall 


‘the children who are tall’ 


Pre-prefixes are not needed on mass nouns when stating general information 


(31). 


(31) Wu-lasi wu-nono. 
14-wulasi 14-sweet 


*Wulasi is sweet: 


The hypothesis that a DP noun argument must be marked with a pre-prefix 
when combing with the universal quantifier is strongly supported by the Kihehe 
data. However, one possible counterexample to this hypothesis is given in ex- 
ample (32). This minimal pair demonstrates the finding that the requirement of 
the pre-prefix on the head noun seems to be stronger for subjects and direct ob- 
jects than for indirect objects. My native speaker consultant judged (32b) only 
marginally worse than (32a). 
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(32) a.  Pre-prefix on indirect object head noun 
Va-pel-(e) a-ma-yembe ga-vili a-v-ana  mbe-va-li. 
2-give-IMP PPF-6-mango 6-two PPF-2-child all-2-all 
‘Give two mangos to all of the children’ 


b. ? No pre-prefix on indirect object head noun 
Va-pel-(e) a-ma-yembe ga-vili v-ana mbe-va-li. 
2-give-IMP PPF-6-mango 6-two 2-child all-2-all 


‘Give two mangos to all of the children’ 


Thus, the Kihehe evidence is consistent with Generalization 6 in that ‘all’ ap- 
pears to combine with a full DP. The second half of Generalization 6, “even when 
other quantifiers do not [attach to a full DP]” will be discussed in tandem with 
Matthewson’s Generalization 7. 


(33) Gen7: In some languages, distributive universals appear to combine di- 
rectly with NP, while other quantifiers do not. (Matthewson 2013: 36) 


As described above, unlike mbe-AGR2-li, the distributive universal quantifier 
kila may not combine with a noun + pre-prefix, and in fact, kila and the pre-prefix 
are in complementary distribution (see footnote 3). If the pre-prefix is a D which 
attaches to NPs to form DP arguments, and kila may never attach to a noun with 
a pre-prefix, then this suggests that kila combines with NPs. Thus, this general- 
ization applies to Kihehe. 

Finally, consider Matthewson’s Generalization 8: 


(34) Gen8: A secondary pattern is to distinguish distributive universal quan- 
tifiers from other universal quantifiers, but for the former to use some 
other strategy such as reduplication, affixation, or adverbial quantification 
(Matthewson 2013: 37) 


As discussed above, the data suggest that Kihehe uses reduplication to express 
distributivity. While the strategy of reduplication is in need of more exploration, 
a preliminary hypothesis is that Kihehe has one strategy of distributive quantifi- 
cation in which the quantifier kila, borrowed from Kiswahili, combines with an 
NP, and one strategy of distributive quantification which uses reduplication. 


6 Conclusion 


This paper describes general properties of the universal quantifiers mbe-AGR2-li 
‘all’ and kila ‘every’ in Kihehe. It then discusses the quantifiers in light of the 
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properties of collectivity and distributivity, and provides data that suggest that 
Kihehe has an additional strategy of reduplication for expressing distributive 
quantification. Ultimately, this paper argues that Kihehe attests patterns simi- 
lar to what has been observed for other Bantu languages: mbe-aGR2-li is com- 
patible with both collective and distributive contexts, while kila is distributive. 
Unlike English every, kila is ungrammatical with a collective reading. However, 
while distributive, kila is infelicitous in contexts in which a ‘one by one’ or ‘one 
after another’ reading is required, though the exact constraints on kila, and a 
more thorough investigation of the strategy of reduplication, are left for future 
research. Finally, this paper considers the Kihehe data in light of the typology 
of quantifiers crosslinguistically and demonstrates that the Kihehe data are com- 
patible with three of Matthewson’s (2013) crosslinguistic generalizations bearing 
on syntactic properties of universal quantifiers. 
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Abbreviations 


Bare numerals (e.g., 1, 2, 3) refer to noun class markers or agreement markers for 
that noun class. 


1SG first person singular DUR durative 

1PL first person plural FV final vowel 
ADD proximal to addressee HAB habitual 
ADJ adjective IMP imperative 
AGR2 agreement marker 2 IMPF  imperfective 
ASSOC associative marker LOC locative 
AUX ` auxiliary N noun 

DEM demonstrative P1 recent past 
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P2 hodiernal past PRS present 

P3 remote past PST past 

PPF pre-prefix REL relative 
POSS possessive # infelicitous 


PROX proximal 
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Chapter 14 


A closer look at the Akan determiner bi: 
An epistemic indefinite analysis 


Augustina Pokua Owusu 
Rutgers University 


This study aims to shed light on the epistemic indefinite interpretation (EI) of the 
Akan (Asante Twi) determiner bi which hitherto had not been discussed in the 
Akan literature. In previous studies, Amfo (2010) and Arkoh (2011) review its ref- 
erential or specific indefinite interpretation. The current study shows that in addi- 
tion to the above interpretation when bi is used, the speaker signals that she does 
not have access to all the information about who or what satisfies the existential 
claim they are making. I employ Aloni (2001) and Aloni & Port's (2015) theory of 
conceptual covers and methods of identification to determine "knowledge" of a 
referent in a particular context. Conceptual covers are sets of individual concepts 
which exclusively and exhaustively covers the domain of individuals (Aloni 2001). 
I show that the epistemic indefinite analysis and the specificity or referential anal- 
ysis are compatible. When bi is used, the speaker asserts she can name a notewor- 
thy or identifying property about the referent of the NP, which is the referential or 
specificity interpretation. Additionally, she presupposes that she is ignorant about 
further characterizing information about the referent, the epistemic indefinite in- 
terpretation. 


1 Introduction 


The study of epistemic indefinites has become popular as the interest in non- 
verbal modality rises (see Aloni & Port 2015; Alonso-Ovalle & Menéndez-Benito 
2003 a.o. for studies on epistemic indefinites). Epistemic determiners or pro- 
nouns, as their name suggests, signal the epistemic state of a speaker, i.e., whether 
the speaker knows the referent of an NP or not. The study of epistemic indefinites 
is usually divorced from the study of specific indefinites or the wide scope inter- 
pretation of the indefinite for two main reasons. First, in most ofthe well-studied 
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languages, the specific indefinite determiner and the epistemic indefinite deter- 
miner are expressed by different morphemes. For instance, in English, specificity 
is encoded by either a wide scope reading of the determiner a or the morpheme 
certain, while epistemic indefiniteness is encoded by the quantifiers some or some 
or other. In German, specificity is encoded by determiners bestimmt and gewiss 
while the epistemic indefinite determiner is irgendein. Secondly, these markers 
appear to express opposing concepts. Specificity requires that the speaker have a 
particular referent in mind; these markers are often used referentially, i.e., there 
is a particular referent that can be identified by the speaker or some other salient 
individual. Epistemic indefinites, on the other hand, require that the speaker be 
“ignorant” about the referent of the indefinite. These indefinites are, however, 
not mutually exclusive. For instance, the German epistemic indefinite irgedein, 
and specific indefinite bestimmt, are felicitous in the same sentence (Aloni & Port 
2015). 

This paper aims to show that these two types of indefinites can be encoded 
in the same lexical item in a language. The Akan determiner bi, which has been 
argued to have a specific indefinite use (Amfo 2010; Arkoh 2011) also has an epis- 
temic indefinite interpretation. The specificity interpretation requires that the 
speaker have a mental representation of a particular individual which he can 
characterize by a noteworthy or identifying property. The epistemic indefinite 
interpretation is a presupposition of ignorance about further characterizing in- 
formation about the referent. These pieces of missing information — what the 
speaker does not know - are the critical properties that determine whether the 
speaker "knows" the referent. Context determines what a speaker has to know 
about a referent to claim he "knows" the referent. 


(1) Akan (Arkoh 2011: 37) 

a. Maame Ama ye-e edziban bi. 
woman Ama do-PAsT food IND 
"Madam Ama cooked (some specific) food’ 

b. Akan (personal knowledge) 
Maame Ama ye-e aduane bi nansome-n- nim aduane kro. 
woman Ama do-PAsT food  iNpbut  1SG.-NEG- know food one 
"Madam Ama cooked (some specific) food, but I don't know what 
food it is? 


The food in (la) is specific; i.e., there is a particular food being cooked by 
Maame Ama that the speaker has in mind. Bi is felicitous since the fundamen- 
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tal way to identify food is by its name, and the speaker does not know this. In 
other words, the ignorance presupposition is satisfied. The fact that the speaker 
does not know the name is stated explicitly in (1b). Adopting Aloni & Port (2015) 
methods of identification and conceptual covers, I account for what counts as 
“knowing” in a context. 

The paper is structured as follows: In §2, I review the specific indefinite analy- 
sis of bi, paying attention to its scope taking properties and its felicity conditions. 
In §3, I discuss epistemic indefinites, their functions and the methods of identifi- 
cations. §4 is the analysis and §5 the conclusion. 


2 The specificity analysis 


The specificity or referential interpretation of bi has been discussed by Amfo 
(2010) and Arkoh (2011). Amfo (2010) argues that bi is an existential quantifier 
that has the cognitive status of referential. The determiner bi is used to intro- 
duce new referents into the discourse. In the spirit of Heim’s (1983) File Change 
semantics metaphor, when bi is used a new file for the NP is created. An ad- 
dressee constructs a referent for the NP by identifying properties that exemplify 
the head noun in question. Arkoh (2011), on the other hand, argues that when bi 
is used as a determiner, it is interpreted as referential along the lines of Fodor 
& Sag (1982) and Kratzer (1998). She argues against Amfo’s (2006) claim that it 
is an existential quantifier. She claims that bi does not have a quantificational 
interpretation. Amfo (2010) and Arkoh (2011) make different predictions about 
the scope of bi. A referential determiner always has wide scope while a specific 
existential indefinite can get an intermediate scope. For Arkoh (2011), one piece 
of evidence against the existential quantifier analysis is the fact that bi+NP can 
make a discourse referent that can be referred to as in (2b). The bare NP, which 
is interpreted as an existential quantifier does not have this property, as a result 
(3b) is infelicitous. 


(2) Akan (Arkoh 2011: 35) 
a. Kwame hwe-e abofra (tuntum) bi. 
Kwame cane-PAsT child black REF 
“Kwame caned a certain (dark) child’ 


b. 9-ye bubuafo. 
3sG.SUBJ-be cripple 


‘S/he is a cripple: 
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(3) Akan (Arkoh 2011: 35) 


a. Kwame hwe-e abofra. 
Kwame cane-PAsT child 


‘Kwame caned a child: 


b. #9-ve bubua-fo. 
3SG.SUBJ-be cripple-NoM 


‘S/he is a cripple: 


The English indefinite determiner a is ambiguous between quantificational 
and referential readings as shown below. 


(4) English (Abusch & Rooth 1997) 
John overheard the rumor that a student of mine had been called before 
the dean. 


a. ‘John overheard the rumor that a particular student of mine namely 


Bill has been called before the dean? 


b. John overheard the rumor that some student of mine has been called 
before the dean: 


In (4a), the indefinite is interpreted as referring to a specific student, one that 
John is aware of. In (4b), on the other hand, John does not appear to know the 
referent of the indefinite. In English, the referential and quantificational interpre- 
tation results from a wide scope or narrow scope reading of the indefinite. The 
aim of this section is not to choose between these two analyses, but to show that 
a referential or specificity interpretation of bi has been explored. What is essen- 
tial for this study is that both analyses agree that when a speaker uses N+bi, he 
intends to refer to a particular referent which he has in mind. 

Even as a referential determiner, bi does not have the same interpretation or 
distribution as the referential definite determiner. Heim (1983) argues that spe- 
cific indefinite, like other indefinites, can introduce a new file. A definite, on the 
other hand, can only be used when updating an existing file; you cannot intro- 
duce a new discourse referent with a definite marker. You can begin narratives 
with a specific indefinite, but not a definite determiner. Another difference is 
that a definite expression is used when a speaker presupposes that the referent 
of the expression is also accessible to the hearer. That is, the speaker assumes 
that there is a unique referent that the hearer can identify. Either (i) because the 
referent was previously mentioned in the context of discourse, or (ii) because the 
referent is part of the interlocutors' shared knowledge, or (iii) because there is 
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enough descriptive content in the sentence to identify the referent. The referent 
becomes identifiable as the sentence is processed (Comrie 1989: 135; Givon 2001: 
450; Gundel et al. 1993: 277; Hawkins 1978: 167-168; Payne 1997: 263). For specific 
indefinites, specificity lies in the fact that the speaker has a particular referent in 
mind. The addressee is just expected to be able to form a representation of this 
referent provided there are enough clues in the utterance itself and an accessible 
context. The speaker does not assume that the listener/addressee knows the ref- 
erent of the NP. The difference between bi and the definite determiner no is that 
of assumed addressee ignorance. When a referent is familiar to both addressee 
and speaker, the definite determiner is used. This is the difference between (5a) 
and (5b). 


(5) Akan (Amfo 2010) 


a. Kwame dze edzibanno maa Ama. 
Kwame take food Fam give-PAsT Ama 


‘Kwame gave the food to Ama’ 


b. Kwame dze edziban bi maa Ama. 
Kwame take food ` IND give-PAST Ama 


‘Kwame gave a certain food to Ama. 


In (5a), the food that Kwame gave to Ama is discourse old; the referent is fa- 
miliar to both the speaker and the addressee. This familiarity may be due to one 
of the reasons mentioned above that makes a referent of NP accessible to an ad- 
dressee. The referent of food in (5b) is, however, discourse new and only familiar 
to the speaker. 


2.1 Scoping-taking properties of bi 


One of the unique and uncontroversial characteristics of specific indefinites is 
their ability to escape scope islands. Scope islands are syntactic configurations 
which disallow wide scope for most quantifiers; these include relative clauses 
and antecedents of conditionals (see Fodor & Sag 1982, and much subsequent 
literature). In addition to taking wide scope, indefinites have been observed to 
take an intermediate scope, outside of the scope island but underneath a higher 
quantifier. In this section, I explore the scope properties of bi with intensional 
predicates, in the context of negation, with other nominal quantifiers, and in the 
antecedent of a conditional. We begin with intensional predicates. 

When bi is embedded under intensional predicates, it always receives a wide 
scope reading. It scopes over the intensional predicate. In this way bi is similar 
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to the German specificity markers bestimmt and gewiss (Ebert & Hinterwimmer 
2012). 


(6) Akan (Personal knowledge) 


a. Kofi re- hwehwe CD bi. 
Kofi -PROG- search CD IND 
‘Kofi is looking for a certain CD’ 
ay [CD(y) ^ search(K,y)]] 

b. Kofi re- hwehwe CD. 
Kofi -PROG- search CD 


‘Kofi is looking for a CD: 


In (6a), Kofi is not going to be happy when he finds just any CD; he will only 
be happy if he finds his Thriller CD. There is no such restriction in (6b), finding 
any CD will make Kofi happy. 

Biis infelicitous in a negative context. Neither a wide scope nor narrow scope 
interpretation is available in (7a). This sentence can only be saved when the in- 
definite determiner is replaced by the NPI biara ‘any’ as in (7b).! 


(7) Akan (Personal knowledge) 
a. "Kofin- hwehwe CD bi. 
Kofi NEG- search CD IND 
b. Kofin- hwehwe CD biara. 
Kofi NEG- search CD any 
‘Kofi is not looking for any CD’ 


Some specific indefinites like the German bestimmt can in principle scope un- 
der negation. When it is licensed under negation, both the wide scope and the 
narrow scope interpretations are technically possible, though speakers disprefer 
the wide scope interpretation. Gewiss, on the other hand, is not licensed under 
negation. 


"The NPI is derived from a combination of the indefinite determiner bi and the emphatic particle 
ara. Biara like any has a free-choice interpretation that is licensed in positive sentences. 


(i) Akan (Personal knowledge) 
Kofi be- gye CD biara. 
Kofi FUT- take CD any 


‘Kofi will take any CD. 
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The indefinite determiner also interacts with other nominal quantifiers like 
the universal quantifier. 


(8) Akan (Personal knowledge) 


a. Obiara hyia -a presidential candidate bi. 
Everyone meet -PsT presidential candidate IND 


'Everyone met a presidential candidate: 


b. Ye- ko-e no obiara to -ə nwomabi. 
3PL- go -PST CFM everyone buy -PST book IND 


"When we went, everyone bought a certain book’ 


(9) a. dyvx[presidential candidate(x)[—]met(y,x)] 
b. vxay[presidential candidate(x)[—]met(y,x)] 


(8a) is ambiguous between a wide scope and a narrow scope reading, just like 
certain in English (see Farkas 2002: ex. 54). The wide scope interpretation is that 
there is a particular presidential candidate, for instance, Hillary Clinton, such 
that everyone met her. The narrow scope reading expresses that for everyone 
there is a unique presidential candidate that they met, Kofi met Trump, Ama 
met Hillary, and Kwame met Bernie. In this way, bi is similar to the German 
specificity marker bestimmt, which is also ambiguous between wide and narrow 
scope interpretations with nominal quantifiers. It, however, differs from the other 
specificity marker gewiss which only has a wide scope reading with nominal 
quantifiers. 

Indefinites scope outside of conditionals despite the fact that conditionals con- 
stitute scope islands for other quantifiers (cf. Fodor & Sag 1982; Endriss 2009 
and the references cited therein). The example taken from Farkas (2002) has two 
readings. 


(10) German (Farkas 2002) 
Wenn Ben ein Problem von der Liste löst, wird Mr. Koens ihn loben. 
If Ben a problem from the list solves will Mr. Koens him praise 


‘If Ben solves a problem from the list, Mr. Koens will praise him: 


First, there is a narrow scope reading for the indefinite that says that Mr. Koens 
will praise Ben if he solves some problem or other from the list; any question 
that he answers will earn him praise. But there is also an exceptional wide-scope 
reading where the indefinite takes scope over the conditional, stating that there 


263 


Augustina Pokua Owusu 


is some specific problem on the list such that Mr. Koens will praise Ben if he 
solves that problem. 

In Akan, only the wide scope meaning is available in this context. The narrow 
scope interpretation is not available; it is only possible when the indefinite is 
replaced by the free choice item biara. 


(11) Akan (Suggested by reviewer) 
Se Kofi tumi bua  nsem bi ano wo nsohwe no mu 
if Kofi be.able answer questions IND mouth be.located exam DEF in 
a me- kye no adee. 
REL 1SG.FUT- give 35G.OBj thing 


‘If Kofi answers some questions on the test/in the exam, I will give him a 


gift. 


(11) only has the reading that there is a particular question such that answering 
that question earns Kofi a gift. 


2.2 Felicity conditions of indefinites 


Ionin & Matushansky (2006) proposes that specific indefinites carry felicity con- 
ditions on their use: a specific indefinite can be felicitously used by the speaker 
only when particular pragmatic conditions have been met. She discusses two 
felicity conditions: noteworthiness and identifiability (cf. Abusch & Rooth 1997; 
Farkas 2002). Ionin (2013) argues that the indefinite this in English carries a con- 
dition of noteworthiness and odin in Russian carries a condition of identifiability. 
In this section, following Ionin (2013), I argue that bi carries both a noteworthiness 
and an identifiability condition. Ionin & Matushansky (2006) proposes that the 
use of this implies that the speaker knows something noteworthy about the ref- 
erent of the indefinite. The condition of noteworthiness is not the same thing as 
speaker knowledge; the speaker can felicitously use a this-indefinite even if she 
does not know the exact identity of the individual under discussion (what counts 
as knowing the identity of the referent will be discussed in the next section). This 
is illustrated in the examples below. 


(12) English (Ionin & Matushansky 2006: 183) 


a. # Mary wants to see this new movie; I don't know which movie it is. 


b. Mary wants to see this new movie; I don't know which movie it is, 
but she's been all excited about seeing it for weeks now. 
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c. I want to see this new movie - I can't remember its name and I have 
no idea what it’s about, but someone mentioned to me that it’s 
really interesting. 


In all the examples above, the speaker states that she does not know the ref- 
erent of the indefinite, but (12b) and (12c) are felicitous because the referent is 
noteworthy. (12a) is infelicitous because the condition of noteworthiness is not 
met. Noteworthiness must be expressed in the sentence. 

Abusch & Rooth (1997) propose that the felicitous use of “a certain X” requires 
the speaker to be able to answer the question “which X is it?” (see also Ebert et al. 
(2012) for a similar proposal for gewiss and bestimmt). Aloni (2001) argues that in 
Russian when a speaker utters odna kniga ‘one book’, the speaker conveys that 
she can answer the question “which book is it?”; the response to this question 
names an identifying property that singles out a specific book, distinguishing 
it from all other books. The identifying property does not have to be the name 
of the book; it may just as easily be some other relevant property that singles 
out a specific book. More importantly, the identifying property must come from 
outside of the sentence. 

The indefinite determiner bi has both a noteworthy and an identifiability felic- 
ity condition, but only one of these conditions needs to be satisfied for its felici- 
tous use in a context. The difference between bi and the English determiner this, 
which only has a noteworthy felicity condition, is shown in (13a). In (13a), the 
sentence is grammatical even though the noteworthy condition is not satisfied, 
while in (12a), the lack of noteworthiness makes the sentence infelicitous. (13a) 
is felicitous in this context because there is an identifiable property ‘new movie’. 
As stated above, identifiability does not only have to do with naming (see Aloni 
& Port (2015) on conceptual covers and methods of identification for epistemic 
indefinites) but any description that is able to set the referent of an NP apart 
from other NPs. Identifiability is context dependent. Context determines what 
counts as an identifiable property of an NP is in order to assume that the speaker 
"knows" it. Context and how it relates to identifiability will be discussed in detail 
in $4. In (13a) for instance, ‘new movie’ is an identifiable property that separates 
the movie Ama wants to watch from other movies. But there can be countless 
new movies at any particular time; it appears this identifying property does not 
qualify as enough information to say that we know the movie in question. 
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(13) Akan (personal knowledge) 


a. Amape se ə- ko-hwe sini foforobi a a- ba. 
Ama want COMP 3SG- MOT-watch movie new IND REL PERF- come 
Me-n- nim sini koro mpo. 
1SG- NEG- know movie one even 
‘Ama wants to see a certain new movie. I don’t even know what 
movie. 

b. Amape se ə- ko-hwe sini foforobi a a- ba, 
Ama want COMP 3SG- MOT-watch movie new IND REL PERF- come, 
me-n- nim sini koro nansoə- a- ka ho asem saa 
1SG- NEG- know movie one but — 3sG- PERF- say self message EMP 
ara. 

EMP 
‘Ama wants to see a certain new movie, I don’t even know what 
movie but she has been talking about it for two weeks: 

c. Me-pe se me- ko-hwe sini fofora bi. Me- n- 
1SG- want COMP 3SG- MOT-watch movie new IND 1SG- NEG- 
kae ne din, me-n- nim nea e- fa ho mpo 
remember 3sG-Poss name 1sG- NEG- know what 3sc- take self even 
nanso obi a- ka a- kyere me se g- ye kama. 
but someone PERF- say CONS- show 1SG.OBJ COMP 3SG- COP nice 
‘I want to see a certain new movie - I can’t remember its name and I 
have no idea what it’s about, but someone mentioned to me that it’s 
really interesting. 


We will now turn to the epistemic indefinite analysis of bi, keeping in mind 
the felicity conditions just discussed. 


3 Epistemic indefinites (EI) 


Aloni & Port (2015) distinguish between two types of indefinites: plain indefinites 
and epistemic indefinites. Plain indefinites like somebody, in addition to their 
conventional meaning, have an ignorance implicature. 


(14) English (Aloni & Port 2015: 117) 
Somebody arrived late. 


a. Conventional meaning: Somebody arrived late. 


b. Ignorance implicature: The speaker doesn’t know who. 
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Epistemic indefinites, on the other hand, are indefinites in which this igno- 
rance inference is conventionalized, i.e., is part of the meaning of the indefinite. 
Epistemic indefinites express the knowledge state of the speaker. Examples of 
epistemic indefinite determiners include German irgendein (Haspelmath 1997; 
Kratzer et al. 2002, cited in Aloni & Port 2015) and Italian un qualche (Zamparelli 
2008, cited in Aloni & Port 2015). The examples below are from Aloni & Port 
(2015). 


(15) German (Aloni & Port 2015: 119) 
Irgendein student hat angerufen. #Rat mal wer? 
Irgend-one student has called guess PRT who . 
Conventional meaning: ‘Some student called’ - the speaker doesn't know 
who. 


(16) Italian (Aloni & Port 2015: 119) 
Maria ha sposato un qualche professore. #Indovina chi? 
Maria has married a qualche professor guess who? . 


Conventional meaning: ‘Maria married some professor’ - the speaker 
doesn’t know who. 


In addition to expressing an existential proposition, these sentences have the 
additional claim that the speaker doesn’t know who the witness to this proposi- 
tion is (Aloni & Port (2015). For this reason, the continuation guess who? results in 
a contradiction. Guess who? presupposes that the speaker has some knowledge, 
which contradicts the ignorance inference of epistemic indefinites, resulting in 
the oddity. This assumed ignorance is not necessarily total ignorance of the ref- 
erent of the NP, just the contextual relevant property to claim knowledge of the 
referent. Plain indefinites, on the other hand, allow for this type of continuation. 


(17) English (Aloni & Port 2015:117) 
Somebody arrived late, guess who? 


As an epistemic indefinite, therefore, bi should behave like irgendein and un 
qualche. We expect that it is infelicitous with Guess who?, when it expresses that 
the speaker is ignorant about the contextually relevant property to characterize 
the NP that asserts knowledge. 

(18) Akan (personal knowledge) 


a. Sukuunibi a- fre wo. #wo hwea e- ye hwan? 
student IND PERF- call 2sG-oBy 2SG- look REL 3sG- cop. who 


‘Some student has called, guess who?’ 
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b. Amaa- ware professor bi # wo- hwe à e- ye hwan? 
Ama PERF- marry professor IND 2sG- look REL. 38G- coP who 


‘Ama has married some professor, guess who?’ 


I have to point out however that (18a) is felicitous when the speaker is sarcas- 
tic, but this context is marked. Bi, therefore, appears to have a conventionalized 
ignorance inference. In addition to guess who, epistemic indefinites are also infe- 
licitous with namely. It becomes felicitous, however, if the speaker signals that 
he is reporting the name and does not know anything else about the referent. 


(19) Akan (personal knowledge) 


a. Sukuunibi fre -e wo & Yes- fre no Kwadwo 
student IND call -PST 25G IMP- call 3sG-05J Kwadwo? 


"Some student called you, # he is called Kwadwo. 


b. Sukuunibi fre -e  wo.o- se ys- fre no Kwadwo. 
student IND call -PST 2sG 3sG- say IMP- call 356-08] Kwadwo 


‘Some student called you, he says he is called Kwadwo. 


In the subsequent sections, I will discuss bi in relation to the functions of epis- 
temic indefinite discussed by Aloni & Port (2015). 


3.1 Functions of epistemic indefinites 


Aloni & Port (2015) discuss four functions of epistemic indefinites. These are 
(i) specific unknown function (SU), when it is used in an unembedded context, 
(ii) epistemic unknown function (epiU), when it is embedded under an epistemic 
modal, (iii) negative polarity item (NPI) function, when it gets narrow scope under 
negation, and (iv) deontic free choice function (deoFC), when it is embedded un- 
der deontic modals. They argue that for an indefinite to qualify for any of these 
functions, it must (a) be grammatical in the context the function specifies and (b) 
have the meaning that the function specifies. As I already discussed in the previ- 
ous section, bi cannot be embedded under negation, which means that it does not 
have an NPI function. Also, though bi can be embedded under a deontic modal, 
it does not have a deoFc function; it only has a specific unknown interpretation 
under deontic modals. 


(20) Akan (personal knowledge) 
e- se se Ama ware professor bi. 
3sG- have comp Ama marry professor IND 


‘Ama must marry some professor: (SU) 
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(20) only has the interpretation that there is a particular professor that Ama must 
marry. It does not have the meaning that it is necessarily the case that Ama 
marries some specific professor, which is the low scope reading of bi. 


3.1.1 Specific unknown function (SU) 


Syntactically, the specific unknown function is characterized by an unembedded 
use of the indefinite, i.e., use in matrix clause and not embedded under negation, 
modals or attitude verbs. Semantically, they have an obligatory ignorance effect: 
the speaker does not know the intended referent of the indefinite. Following 
Aloni & Port (2015), I will use the following continuation to distinguish between 
the specific and non-specific uses of the indefinite. 


(21) English (Aloni & Port 2015:118) 
John wants to marry a Norwegian 


a. She lives in Oslo and is 25 years old. 
b. One with blond hair and blue eyes. 


Bi like both irgendein and un qualche is grammatical in the specific unknown 
context and the interpretation specified. 


(22) Akan (personal knowledge) 


a. Amaa- ware professor bi. 4 wo- hwe a e- ye hwan? 
Ama PERF- marry professor IND 2SG- look COND. 3s6- cop who 


‘Ama has married some professor, #guess who?’ 


b. Sukuunibi fre -e wo. #wo hwea e- ye hwan? 
student IND call -PST 2SG-0BJ 2sG- look COND 3sG- cop who 


‘Some student has called, #guess who?! 


Like irgendein and un qualche, bi has the weaker modal variation interpreta- 
tion and not the stronger free choice interpretation. Aloni & Port (2015), follow- 
ing Alonso-Ovalle & Menéndez-Benito (2010), distinguish between two types of 
modal inference: modal variation and free choice. For the modal variation in- 
terpretation, more than one (but not necessarily all) alternatives in the relevant 
domain qualify as possible options. For the free choice interpretation, all the al- 
ternatives in the relevant domain qualify. In (23), the sentence is still true even if 
there are professors that the speaker has enough evidence to eliminate from the 
list of possible professors that Ama could have married. 
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(23) Akan (personal knowledge) 
Amaa- ware professor bi Me-n- nim nipa koro nanso 
Ama PERF- marry professor IND 15G- NEG- know human person but 
me- ye surese g- n- ye Kofi 
1sG- do sure COMP 3SG- NEG- COP. Kofi 


‘Ama has married some professor. I don’t know who it is. I am sure it is 
not Kofi? 


3.1.2 Epistemic unknown function (epiU) 


An ignorance effect similar to the one with specific unknowns arises when bi is 
embedded under epistemic modals. 


(24) Akan (personal knowledge) 
e- be- tumi a- ba se | Amaa- ware professor bi. 
3SG- MOD- be.able CONS- come coMP Ama PERF- marry professor IND 


‘It could be that Ama has married some professor: 


This also has the weaker modal variation interpretation. It is compatible with 
the hide and seek scenario described in Aloni & Port (2015), where not all the 
alternatives in the relevant domain qualify. 


(25) Akan (personal knowledge) 
e- be- tumi a- ba se  Amaa- ware professor bi. e- 
3SG MOD- be.able cons- come comp Ama PERF- marry professor IND 3SG- 
te saa a me- ye surese e- n- ye Kofi. 
COP DEM COND 15G- do sure COMP 3SG- NEG- COP Kofi 


‘It could be that Ama has married some professor. If that is true, I am sure 
it is not Kofi: 


Similar to irgendein and un qualche, bi embedded under propositional attitude 
verbs have agent oriented ignorance effects. 


(26) Akan (personal knowledge) 
Nana gye di se Amaa- ware professor bi. 
Nana collect eat comp Ama PERF- marry professor IND 
‘Nana believes Ama has married some professor. 
‘Nana believes that Ama married some professor, I don't know who: (SU) 


‘Nana believes that Ama married some professor, Nana don't know who’ 
(EpiU) 
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Bi therefore is more similar to un qualche which has no NPI and deoFC func- 
tions. Table 1, taken from Aloni & Port (2015), shows some cross-linguistic com- 
parison of epistemic indefinites; I have added bi to this table. 


Table 1: Cross-linguistic comparison of epistemic indefinites 


SU epiU NPI deoFC 
irgendein yes yes yes yes 
alg'un (SP) yes yes yes no 
un qualche yes yes yes no 
-si (Cz) yes no no no 
vreun (Rom) no yes yes no 
any(En) no no yes yes 
qualunque (It) no no no yes 
bi (Akan) yes yes no no 


Akan confirms Aloni & Port’s (2015) hypothesis that there should be no lan- 
guage where an epistemic indefinite has a deoFC function but not an NPI func- 
tion. 


3.2 Methods of identification and conceptual covers 


There are at least two ways in which a context can determine a quantificational 
domain, domain widening and method of identification (conceptual covers). A 
conceptual cover is a set of individual concepts which exclusively and exhaus- 
tively covers the domain of individuals (Aloni 2001). 


(27) [Definition of CONCEPTUAL COVERS] Given a set of possible worlds W 
and a domain of individuals D, a conceptual cover CC based on (W, D) is 
a set of individual concepts [i.e., functions W — D] such that: 


Vw€W :vdeD:atceCC :c(w)-d 


She explains this with a card scenario which I repeat below. In front of you lie 
two face-down cards, one is the Ace of Hearts, the other is the Ace of Spades. You 
know that the winning card is the Ace of Hearts, but you don't know whether 
it's the card on the left or the one on the right. Now consider (28): 


(28) English (Aloni & Port (2015)) 
You know which card is the winning card. 
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Based on the scenario above, sentence (28) could be true or false in the described 
scenario. Intuitively, there are two different ways in which the cards can be iden- 
tified here: by their position (the card on the left, the card on the right) or by 
their suit (the Ace of Hearts, the Ace of Spades). Our evaluation of (28) seems 
to depend on which of these identification methods is adopted. In the seman- 
tics of knowing-wh constructions proposed in Aloni (2001), the evaluation of (28) 
depends on which of these covers is adopted. She adds that this dependency is 
captured by letting the wh-phrase range over concepts in a conceptual cover in- 
stead of plain indefinites. Cover indices n are added to their logical form, and 
context supplies their value. 


(29) English (Aloni & Port 2015) 
You know which-n card is the winning card. 
False if n —> on-the-left, on-the-right 
True if n — ace-of-spades, ace-of-hearts 
Trivial if n —> the-winning-card, the-losing-card 


Conceptual covers and methods of identification are essential in understand- 
ing especially the specific unknown function of epistemic indefinites. When a 
speaker uses a specific marker, she signals that she has a particular referent in 
mind and that she can identify the referent of the indefinite. This appears to con- 
flict with the ignorance inference that I have argued that epistemic indefinites 
have. The natural way to resolve this conflict is to assume that there are two 
methods of identification at play, the speaker knows one but not the other. The 
ignorance is not about all methods of identification for the referent, but for the 
essential one in that particular context. 


3.2.1 Methods of identification 


In this section, I explore the different methods of identification and the context 
of use that license bi. I compare bi to the German irgendein and the Italian un 
qualche. The method of identification that will be discussed are naming, ostension 
and description. 
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3.2.1.1 Description and Naming 


Scenario: You are visiting a foreign university and you want to meet some pro- 
fessor. 


(30) Akan (personal knowledge) 
Me- re hwehwe professor bi, onona ə- ye head of 
1SG- PROG- search professor IND, 3sG FOC. 3SG- COP. head of 
department, me- n- nim ne din. 
department, 1sG- NEG- know 3sG-Poss name. 
‘I am looking for some professor, he is the head of department but I don't 
know his name: 
Speaker-can-identify — [Description], unknown — [Naming] 


In this scenario, the method of identification contextually required for knowledge 
is naming, but the referent of the epistemic definite can only be identified by 
description. 


3.2.1.2 Naming and Ostension 


Scenario: At a conference, you have to meet a famous linguist. 


(31) Akan (personal knowledge) 
g- wo se  me-hyia professor bi, ye- fre no Nana Aba nanso 
3sG- have comp 1sG- meet professor IND, 3sG call her Nana Aba, but 
me-n- nim no. 
1SG- NEG- know 3sG 
‘I have to meet some professor, her name is Nana Aba, but I don’t know 
her. 


Speaker-can-identify — [Naming], unknown — [Ostension] 


In this scenario, the method of identification contextually required for knowledge 
is ostension, but the referent of the epistemic definite can only be identified by 
naming. 

3.2.1.3 Ostension and Naming 


Scenario: You are watching a football match and a player gets injured, so you tell 
your friends: 
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(32) Akan (personal knowledge) 
Hwe player bi a- pira, ye fre no sen? 
look player IND PERF- be.injured, 3PL call 3sG.oBy what 
‘Look, some player is injured, what is his name?’ 
Speaker-can-identify — [Ostention], unknown — [Naming] 


In this scenario, the method of identification contextually required for knowledge 
is naming, but the referent of the epistemic definite can only be identified by 
ostension. 

Aloni (2001) ranks the method of identification as indicated in (33): 


(33) Ostension > Naming > Description 


Like bi, irgendein is felicitous in all the scenarios presented, un qualche on the 
other hand is infelicitous in the third scenario when the speaker could identify 
by ostension, but naming was unknown. Bi behaves like Germanic epistemic 
indefinites. 


4 Analysis 


Following Ionin (2013), I propose the following as the semantics of bi. 


4.1 Semantics 


A sentence of the form [bi a] P expresses a proposition only in those utterance 
contexts c where the speaker intends to refer to exactly one individual or the 
max of a group? y which is [a] in c and the relevant felicity conditions in (34a) 
or (34b) are fulfilled. Then [bi æ] f is true at an index y if y is f at y and false oth- 
erwise. In addition to the felicity conditions, there is a presupposition that states 
the noteworthy and/or identifying property is all the information the speaker 
has about [bi æ]. This presupposition is what differentiates bi and the English 
some from English the specific indefinite a. These sentences are only grammati- 
cal when they fulfill either of the conditions below. 


(34) a. For [bi a] f, the speaker has in mind a noteworthy property 
9 € Dis, et) such that g(wc)(y) = 1. 
b. The speaker is able to name an identifying property o € DKs, et such 
that g(we)(y) = 1 and Vz[a(wc)(z) = 1 and z 2 y] > g(we)(z) + 1] 
and o + a and ọ + f. 


? Bi is compatible with plural nouns. 
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c. Presupposition: Speaker is unable to provide any further information 
about who or what satisfies the existential claim s/he is making. 
Alonso-Ovalle & Menéndez-Benito (2003) for English some and 
Spanish algun. They did not, however, state it as a presupposition. 


This meaning rules out a kind or generic interpretation for bi. (35a) can only mean 
that there is exactly one dog that Kofi saw, it cannot have the interpretation that 
Kofi saw something of the kind dog. When a plural is used with the determiner 
as in (35b), the determiners quantifiers over the sum or the max of the referents. 


(35) Akan (personal knowledge) 


a. Kofi hu -u kraman bi. 
Kofi see -pst dog bi 


‘Kofi saw a certain dog’ 


b. Kofi hu -u n- kraman bi. 
Kofi see -PsT PL- dog bi 


‘Kofi saw some dogs’ 


What is considered noteworthy is provided by the sentence, and identifiability 
supplied by context. For an N+bi to be felicitous, the noteworthy or identifying 
felicity condition must be satisfied, i.e., the speaker needs to be able to name at 
least one defining property of the referent with which an addresser can construct 
a referent of their own. And also the presupposition must be satisfied, i.e., what- 
ever the noteworthy or identifying property is, its use is not sufficient to claim 
“knowledge” of the referent in the context. What qualifies as sufficient informa- 
tion to “know” the referent is regulated by the method of identification licensed 
by the context. In other words, in every context, there are two ways to identify 
the referent of an NP. One is whatever properties, descriptions or characteristics 
that are used in the sentence to identify it. This helps an addressee construct a 
referent of their own. Then, there is the way a particular context requires that 
the referent of an NP is identified in other to count as “knowing” the referent. 
For instance, if an NP is described as a “red flower” in a sentence, then the first 
method of identification is satisfied. But if the contexts require that someone only 
know what flower it is by knowing the name of the flower, then “red flower” is 
inadequate in this context. The use of N+bi signals that the second identifying 
property, i.e., the one required by context to “know” the referent is not satis- 
fied. This idea of multiple methods of identification is what conceptual covers by 
Aloni & Port (2015) appear to capture: 
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Suppose m is the conceptual cover representing the identification method 
contextually required for knowledge, then EI signal an obligatory shift to a 
cover n different from m. That is, they existentially quantify over a cover 
which represents a method of identification which is not the one at play in 
the relevant context. (Aloni & Port 2015: 132) 


If we take a look at (36), the speaker has asserted a noteworthy and identify- 
ing property about the flower. This property does not, however, show that the 
speaker "knows" the flower. What does it mean to know a flower? We use names 
to identify flowers. There are dozens of red flowers and thus knowing the color 
of a flower does not count as knowing it. 


(36) Akan (personal knowledge) 
Me- hu -u nhwiren kokoo bi. 
isG- see -PST flower red IND 


‘I saw a certain red flower’ 
Speaker-can-identify — [Description], unknown — [Naming] 


In this scenario, the method of identification contextually required for knowledge 
is naming, but the referent of the epistemic definite can only be identified by 
description. The ignorance component of the indefinite does not conflict with 
the noteworthy and identifying property requirement. 

In example (37), the book is identified as the book lying on the table, as against 
the one on the bookshelf. 


(37) Akan (personal knowledge) 
Me- hu -u nwomabi wo pono no so. 
isG- see -PST book  IND be.located table DEF top 


"There is a book on the table ` 


This sentence is (in)felicitous in different contexts. 


Context 1: I am sitting by the table and I can see War and peace on the table. 
Someone is looking for War and peace. 


(38) Akan (personal knowledge) 
# Nhoma bi daponono so nansome-n- nim nhoma koro. 
book IND lie table DEF top but 1sG- NEG- know book one 


"There is some book on the table but I don’t one which book it is? 
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In this context, the speaker knows the name of the book and can see it. 
What appears to count as knowing a book is knowing its name. Thus the 
identification supplied by the sentence and the identification required by 
the sentence are the same, there is no speaker ignorance. The speaker 
knows what is necessary and sufficient to identify the book in this con- 
text, the ignorance component of bi is not fulfilled, and so the sentence is 
infelicitous. 


Context 2: I am sitting at the table, and I can see War and peace on the table. 
Someone is looking for some book, but I do not know what book they are 
looking for. 


(39) Akan (personal knowledge) 
Nwomabi da  ponono so. 
book _ IND sleep table DEF top 


"There is some book book on the table’ 


In this context, the speaker knows the name of the book and can see it. 
What s/he does not know is if the referent of the N satisfies what the 
addressee is looking for. So here, knowing the name of the book is not 
sufficient, what counts as “knowing” in this context is knowing what the 
speaker is looking for. The identification supplied by the sentence and the 
identification required by the sentence is different, speaker ignorance is 
satisfied, and the sentence is felicitous. 


Context 3: I am cooking in the kitchen; someone is looking for War and peace. I 
remember I saw War and peace on the table. 


(40) Akan (personal knowledge) 
#Me-hu -u nwomabi wo pono no so nanso me- 
isG- see -PST book ND be.located table DEF top but — 1sG- 
n- nim nhoma koro. 
NEG- know book one 


"There is some book on the table but I don't one which book it is? 


In this context, the speaker knows the name of the book and she knows 
this satisfies the what the addressee is looking for. Knowing the name of 
the book is sufficient and counts as knowing what the speaker is looking 
for. The identification supplied by the sentence and the identification re- 
quired by the sentence is the same. Speaker ignorance is not satisfied, so 
the sentence is infelicitous. 
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Context 4: I am cooking in the kitchen; someone is looking for War and peace. I 
remember I saw a book on the table, but I do not remember exactly what 
book it was. 


(41) Akan (personal knowledge)) 
Me-hu -u nwomabi wo ponono so nanso me- n- 
isG- see -PST book IND be.located table DEF top but 1SG- NEG- 
nim nhoma koro. 
know book one 


"There is some book on the table but I don’t one which book it is: 


In this context, what counts as knowing is knowing what the addressee is 
looking for. The speaker does not know the name of the book. Therefore 
the identification supplied by the sentence (description) and the identifica- 
tion required by the sentence (naming) are different. Speaker ignorance is 
consequently satisfied, and the sentence is felicitous. 


5 Conclusion 


I have shown how that bi has both a specific indefinite and an epistemic indef- 
inite and these interpretations do not contradict each other. The identifiability 
felicity condition that is required for the interpretation of the specific indefinite 
feeds the epistemic indefinite interpretation. I have argued that when bi is used, 
the speaker signals that she does not have access to all the information that is 
required to "know" a referent in a particular context. She has some information 
about the referent to identify him, but not enough to "know" him. In so doing, I 
have highlighted a meaning of the indefinite determiner bi that has hitherto not 
been discussed in the Akan literature. 
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Abbreviations 

CEM  Clausal Final Marker MOT Verb of motion 
comp Complementizer NEG Negation 
cop Copular PERF Perfect 

DEF Definite Poss Possessive 
DEM Demonstrative PROG Progressive 
EMP Emphatic marker PRT Particle 
Fam Familiar PST Past 

FUT Fut Ref  Referential 
IND Indefinite REL  Relativizer 
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Chapter 15 


The interaction of -9-...-ile with 
aspectual classes in Nyamwezi 


Ponsiano Sawaka Kanijo 
University of Gothenburg 


This study investigates different readings of the construction -g-...-ile resulting 
from the interaction with aspectual classes in Nyamwezi, a Bantu language spoken 
in Tabora, Tanzania. The study is motivated by two observations. Firstly, in previ- 
ous analyses (Maganga & Schadeberg 1992), it was noted that -g-...-ile in Nyamwezi 
selects few verbs, but the patterns governing its distribution were not identified. 
Secondly, not much has been done on describing the semantics of verbal roots in 
Nyamwezi (and many other Bantu languages), which appears to be a source of vari- 
able temporal interpretations of -g-...-ile. Based on field data, I show that the first 
main function of -g-...-ile is that of stativizer. That is, -2-...-íle picks out a phase 
of an event or the entire event (if the event lacks phasic structure) and presents 
it as a stable, undifferentiated property. -g-...-ile does so in three different ways, 
which give three different readings: (i) resultative, when it occurs with achieve- 
ments, (ii) general present, which occurs with statives and (iii) a progressive-like 
reading, which occurs with some activity verbs classified as “directed motion verbs” 
in this study. The discussion of the contrast between these readings is informed by 
a progressive form -lrr-. Many other activity verbs do not typically occur with -g- 
…-ile. Those that occur with this construction either suggest the change-of-state 
or condition of the verb's subject (which has the semantic role of patient; patien- 
tive subject) or they are used in a special pragmatic condition where -g-...-ile is 
coerced. Coercion of -g-...-ile expresses a contradiction and/or emphasis, which 
can be regarded as a second main function of this construction. 


1 Introduction 


Sixty-six percent of Bantu languages use the verb suffix - ile (also -ire, -ite, -ide or 
-i) to refer to a variety of temporal notions, ranging from past tense, perfective 
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aspect to perfect/anterior (Nurse 2008; Botne 2010). In most of these languages, 
-ile occurs with the tense marker -a- at the pre-stem slot to refer predominantly 
to far or middle past. When -ile occurs without the tense marker -a-, it is used 
to express either past/perfective or perfect. Nyamwezi is one of these Bantu lan- 
guages in which -ile occurs both with and without the pre-stem TAM marker 
-a-. The suffix -íle with -4- (-á-...-íle) indicates past perfective, as exemplified in 
(1). The construction -4-...-ile in this language occurs with all types of verbs and 
maintains the past perfective reading. 


(1) waamialilé 
o-á-mal-íle 
3SG-CPL-finish-ile 
‘S/he finished (yesterday to infinity)’ 


Unlike many other Bantu languages in which the verb suffix - ile can occur with 
different types of verbs to encode different readings (see e.g. Brisard & Meeuwis 
2009), in Nyamwezi the suffix -ile, without the tense marker -á- (-g-...-ile) is re- 
stricted to a small set of verbs. For instance, it does not commonly occur with 
what is called dynamic verbs (those which denote a process, action or activity) 
such as ‘plant’, ‘write’, ‘drill’, etc. The same was also observed in a grammatical 
sketch of Nyamwezi by Maganga & Schadeberg (1992). Maganga & Schadeberg 
note that -g-...-ile selects few verbs, mostly change-of-state verbs, to indicate the 
newly entered state. However, due to the restricted purpose of their study, they 
did not go into detail to describe all types of verbs that occur with this construc- 
tion and to determine the readings when the verb occurs with -g-...-ile. As they 


say: 


Unfortunately, we have not been able to work out the semantic subclassifi- 
cation of verbs which would allow us to describe the function of this tense 
[sic]. Maganga & Schadeberg (1992: 126) 


This paper, therefore, is an attempt to provide a clearer understanding of the 
readings of -g-...-ile in Nyamwezi when interacting with different types of verbs 
(which will be referred to as aspectual classes). 

The reminder of this paper is organized as follows. 82 presents basic informa- 
tion about Nyamwezi, followed by a brief discussion of the relationship between 
-g-...-ile and other tenses/aspects. The description of aspectual classes is given in 
83. Following this section, $4 presents a discussion of the interaction between -9- 
…-ile and aspectual classes. The paper closes with a brief summary of the study 
and some recommendations for further studies in 65. 
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2 Background 


2.1 Nyamwezi language 


Nyamwezi is a Bantu language spoken in the central western part of Tanzania, in 
the area called Tabora. The language is spoken by 1,320,000 people of Tanzania, 
where seventy-three percent of the speakers are found in the Tabora area (Lewis 
et al. 2013). The language is not highly endangered, despite the inevitable spread 
of Swahili in Tanzania. It is still spoken by people of all ages (Lewis et al. 2013). 

According to Guthrie’s 1967 widely acknowledged classification system, 
Nyamwezi is classified as F22. There are four dialects of Nyamwezi according 
to Masele (2001). These include Kinyanyembe, Kidakama, Sigalaganza and Ki- 
konongo. The description in this paper is based on the Kidakama dialect, spoken 
in Uyui district. 


2.2 -o-...-ile and its relation to other tenses/aspects 


The suffix -ile, apart from this basic form, can also be realized as -il-w-e, -ije and 
-tile. Verb stems that end with a glide create many variants. In longer stems of the 
form CVCW the glide /w/ is infixed within the suffix -ile. The form -ije occurs 
in verb stems that end in the sequence Cy, whereby the consonant /l/ of -ile is 
made palatal. Monosyllabic verb stems select a variant of -ile with a long vowel. 
All forms are exemplified in Table 1. 


Table 1: Variants of -ile 


The form of the stem Stem Gloss -ile form 
CVCw stems -togwá ‘love’ -togilwe 
-chilwa ‘hate’ -chililwe 
Stems ending in Cy  -sefya ‘boil’ -sefije 
-sufiya ‘dilute’ -sufije 
Monosyllabic stems -faá ‘die’ -fiile 
-pyaá ‘be ripe’ -piile 


Nyamwezi has a very complex tone system. The language is characterized by a 
H tone shift, where the underlying H tone of the subject/object concord and that 
of stem tends to shift one mora further to the right. Each type of tense and/or 
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aspect has a distinctive tone rule. The tone of -g-...-ile, as described in Maganga & 
Schadeberg (1992: 126), is shown in (2). TC/SC stands for tone copy of the subject 
concord. In all examples in this paper, the first sentence indicates the surface 
structure and the second one shows the underlying structure. 


(2 The tone pattern of -g-...-ile 
sc - (oc) - vB - ile + Tc/sc 


Nyamwezi has a highly agglutinative verbal structure with many possible 
tense, aspect and mood (TAM) prefixes and suffixes. As shown in (3), the TAM- 
marking morphemes in the verbal structure occupy the positions ofthe formative 
(slot 4), the post-final (slot 8) and the final (slot 9). The suffix -ile occupies the 
final position. 


(3) Nyamwezi (Maganga & Schadeberg 1992: 97) 
The structure of verb forms 
1 2 3 4 5 6 7 8 9 10 
PreIn In PoIn Fo Fo2 PreR VB PreF F PoF 
ASS SCNEG TAMIT OC Root/EXT TAM TAM/FV PL 


Maganga & Schadeberg (1992) note that the suffix -ile without the pre-tense 
marker -á (-g-...-ile) typically indicates that a new state has been reached. This 
occurs most obviously in change-of-state verbs (such as -gaanda ‘become thin’), 
as in (4), where -g-...-ile has a resultative meaning. 


(4) agaandilé 
a-9-gaand-ile 
3sG-become thin-íle 


‘S/he is thin (because s/he has become thin)’ 


-g-...-ile, while maintaining this reading, can appear in the past, present and 
future tenses. In the past tense, the construction occurs with an auxiliary verb 
-lr ‘to be’ as in (5), and in the future tense, it occurs with an auxiliary -kofiií ‘to 
be' in (6). 


(5) waal áálaalílé aho tulnfitä ywiipodlu 
o-á-lt a-g-laal-ile aho  to-ln-frt-á mu-ipoólü 
3SG-CPL.REC-AUX 35G-sleep-ile when 2PL-PROG-pass-FV LOC-forest 


‘S/he was sleeping when we were crossing the forest: 
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(6) liindag' hádoóo, igolo gakofüi gáfoombílé 
lind-ag-a  hadoóo, igolo ga-kofsií ga-o-Doomb-íle 
wait-IMP-FV short while tomorrow cl.6-Aux cl.6-soak-ile 


"Wait a bit, tomorrow they will be soaked: 


In the present tense, the construction -g-...-ile appears alone without -lr or 
-küfii as shown in (7), where the speech time is now. 


(7) nation" aalaalilé, ^ km  akolugolaga mlyaango! 
ná-In-fón-a a-g-laal-ile, km  a-ko-lugol-ag-a m-lyaango! 
1SG-PROG-see-FV 35G-sleep-ile why 3SG-NEG-open-REC-FV cl.3-door 


‘I think (lit. see) s/he is asleep, why s/he doesn’t open the door!’ 


Since the discussion of aspectual classes is crucial in understanding the mean- 
ing and interpretation of -g-...-ile, the following section describes briefly aspec- 
tual classes in Nyamwezi. Note that aspectual classes described in this paper can 
be further differentiated. The discussion is limited to broad aspectual classes, but 
where necessary, minor classes will be highlighted. A further and detailed anal- 
ysis of aspectual classes of Nyamwezi is still in progress. 


3 Aspectual classes 


The discussion of aspectual classes in this section pertains not only to the cate- 
gorization of verbs themselves, but also to the nouns the verbs tend to associate 
with. Following Rothstein (2004: 1), the term aspectual classes (which is also re- 
ferred to as lexical aspect or Aktionsart) is used in this study to refer to the event- 
types denoted by verbal expressions. The most widely received aspectual classes 
of verbal expressions are those stipulated by Vendler (1957). This includes states, 
activities, achievements and accomplishments. These classes are usually defined 
on the basis of temporal properties such as telicity and dynamicity. Telicity dif- 
ferentiates aspectual classes that indicate an inherent endpoint (telic) — achieve- 
ments and accomplishments, and those which have the opposite pattern (atelic) - 
states and activities. Dynamicity differentiates aspectual classes that have com- 
plex internal structure - activities and accomplishments, and those which have 
homogeneous internal structure - states and achievements. 

A number of diagnostic tests have been developed to determine Vendler's as- 
pectual classes. However, as Bar-el (2015: 105) notes, many of these tests are 
not applicable universally. Filip (2012: 724), specifically, points out that not all 
the tests developed for English can be transferred to other languages, due to 
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language-specific properties. In Bantu languages, Kershner (2002), Botne (2008) 
and Persohn (2017a,b), based on tenets of radical selection theories, a number of 
theories which classify aspectual classes with respect to the boundaries between 
different phases of an event, develop a number of language specific tests. These 
tests reveal more complex aspectual classes than those stipulated by Vendler. 
Before proceeding to the classification of aspectual classes in Nyamwezi using 
these Bantu specific tests, I will briefly outline a framework which has been used 
in many studies on aspectuality in Bantu languages, and also in this study, to 
classify aspectual classes. 


3.1 Phase representation 


One way in which the events named by the verbs may be differentiated is with 
respect to their temporal internal structures. In Bantu languages, Botne (1983), 
and Botne & Kershner (2000), following Freed (1979), propose a framework that 
takes into account three potential phases of an event, onset, nucleus and coda, to 
classify aspectual classes. The onset (O) phase, which may or may not be included 
in the event’s temporal structure, constitutes the initial or preparatory phase of 
the event. It is a preparatory stage for the nuclear activity of the event. The coda 
(C) phase constitutes a final phase which brings the event to a definite end or toa 
state resulting from the prior event. Lastly, the nuclear (N) phase is characterized 
as comprising the characteristic and prominent feature of the event. All three 
phases are schematically represented in Figure 1. 


onset | nucleus | coda 


Figure 1: Phase representation 


In this framework, verbs are classified into different aspectual classes with 
respect to a number of phases they encode (only N, all three phases, N and C/O, 
etc.), and based on whether particular phases are punctual or durative. Botne 
(2010: 32-33) categorizes two major aspectual classes based on this framework, 
actions and achievements. The major difference between these two classes is 
that achievements have a punctual nuclear phase (N) that denotes a change-of- 
state whereas actions do not. According to Botne, actions include Vendler’s (1957) 
activities and Smith’s (1991) semelfactives. Schematic representations of these 
aspectual classes are shown in Figure 2. 
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ENS l 
(O N N 


Activity Semelfactive 


Figure 2: Event structures for actions (Botne 2010: 33) 


Botne (2010) further classifies achievements into three subclasses which differ 
from one another in whether or not they incorporate an onset (O) and/or coda 
(C) phase. That is, inceptive achievements (such as die in English) encode a poten- 
tial onset phase leading to the change-of-state nucleus, transitional achievements 
(such as bend in English) encode both an onset and stative coda phase, while re- 
sultatives (such as English rupture) encode only the point-of-transition (N phase) 
and the stative coda. These subclasses are schematized in Figure 3. 


D == — C 
O N ONC N C 
Inceptive Transitional Resultative 


Figure 3: Event structures for achievements (Botne 2010: 33) 


Kershner (2002: 62) based on the same framework, classifies another impor- 
tant aspectual class, namely statives, which is not addressed in Botne (2010). Ac- 
cording to Kershner (2002), statives (in Chisukwa) represent an aspectual class 
without a phasic structure, thus she uses a dashed line to present the event struc- 
ture of statives, as shown in Figure 4. 


Figure 4: Event structure for statives (Kershner 2002: 62) 


3.2 Nyamwezi aspectual classes 


With regard to the purpose of this study (which is not mainly about lexical 
aspect), Nyamwezi aspectual classes are classified into three: (i) achievements 
(Botne’s (2010) transitional), (ii) statives, and (iii) duratives (Botne’s (2010) ac- 
tions). Statives fall into an independent class different from Botne (2010) who 
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classified them under one of the achievement’s subclasses called resultative. Fol- 
lowing Botne (2010), I further classify duratives into activities and semelfactives. 
Semelfactives are like achievements in allowing punctual readings, but unlike 
canonical achievements they also allow iterative readings (cf. Kearns 2000). For 
this latter reason, semelfactives are classified as one of the subclasses of dura- 
tives. But, as will be further explained later, duration of semelfactives does not 
describe a clearly identifiable process like that of activities. 

All classes are summarized in (8) and discussed in brief afterwards. I will use 
the framework shown in Figure 1 to show different event structures of each class, 
and apply at least one or two tests in each class to unveil the range and character- 
istics of the lexicalized temporal phases. Some of these tests were adopted from 
Kershner (2002), Crane & Fleisch (2016) and Persohn (2017b). 


(8) Nyamwezi aspectual classes 
Aspectual Classes 


| 
I T 
Achievements States Duratives 


Activities Semelfactives 


By definition, ACHIEVEMENTS have a punctual nuclear phase, which constitutes 
a change (or transition) from one state (or location) to another. There are quite 
a number of achievements in Nyamwezi. These include -gaasa ‘ferment slowly’, 
-gina ‘become fat’, -pola ‘become rotten’, -gwa ‘fall’, -pola ‘become cold’, -lola 
‘become sour’, -lrma ‘go up (e.g. a tree, a hill)’, -mana ‘get to know’ and -Boomba 
‘soak’. As schematically represented in Figure 5, achievements encode an onset, 
a punctual point-of-transition and the resultant/coda phase. Achievements may 
be further classified based on whether an onset phase is or is not part of the event 
structure (see Figure 3). But, this sub-classification of achievements seems to be 
of minor relevance to the discussion of the readings of -g-...-ile. 


OIC 


N 


Figure 5: Achievement event structure 


One of the tests for achievements in Nyamwezi is their interpretation with 
-andya ‘begin’. In most of these aspectual classes, -andya implies that the coda 
phase has not yet been reached, but the onset phase has started (9). 
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(9) gaandyága kopola ila gakapólílé 
ga-á-andy-ag-a ko-pol-a ila ga-ká-pol-íle 
cl.6-begin-cPr-start-REC-FV INF-cool_off-Fv but cl.6-NEG-cool off-íle 
kisoga 
kisoga 
good 


‘It (the water) has begun to become cold, but it is not cold yet (it is warm)’ 


In (9), ‘but it is not cold yet’ gives some evidence that the onset phase has 
begun, but the stative coda phase has not yet been reached. 

Another test for achievements is their compatibility or incompatibility with 
the persistive form -táá- ‘still (being)’ plus progressive form -lrr-. The -táá- plus 
-lu- construction, in principle, denotes a situation which is still in progress at 
or around the reference time, thus usually compatible with duratives (where the 
construction targets durative nucleus (see e.g. 14) and statives (where it targets 
the continuing state (see e.g. 13). Since the nuclear phase of achievements is punc- 
tual, the -táá- plus -lrr- construction is usually not accepted (10). However, for 
those achievements which are acceptable with -táá- plus -lrr- (e.g. 11), the test 
selects an extended onset phase, a phase before a punctual change-of-state. So, 
example (11) still emphasizes that the change-of-state nuclear phase of achieve- 
ments is punctual regardless of its co-occurrence with -táá- plus -Iz-. 


(10) * gataälr galupola 
ga-taa-lr ga-In-pol-a 
cl.6-still-Aux cl.6-PRoG-become rotten-Fv 


Lit. ‘They (e.g. mangoes) are still becoming rotten’ 


(1) gataál galufoómbá, liindulaga hadoóo 
ga-táá-lr ga-ln-Boomb-a ` liind-nl-ag-á hadoóo 
cl.6-still-Aux cl.6-PROG-soak-Fv wait-APPL-IMP-FV short while 
gakopoomba ` doó 
ga-ku-Boomb-a dod 
cl.6-FUT-soak-Fv few 


‘They (e.g. cassava) are still soaking; wait a short while, they will be 
soaked’ 


STATIVES differ from achievements (and also duratives) in one aspect; they 
have no phasic structure making up different phases of an event. All they do is 
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describe an indefinite state. Following Kershner (2002), statives are schematically 
represented by a dashed line, as shown in Figure 4. Examples of statives are verbs 
such as -chilwa ‘dislike’, - ikólá ‘resemble’, -saata "be sick’ and -togw ‘love, like’. 

One way to show that statives form a separate class from achievements is 
their interpretation with -andya ‘begin’. We saw that in achievements -andya 
asserts that the onset phase has started but the following coda phase has not 
been reached. This is not true with statives. It is impossible to specify all phases 
of the event in statives, thus -andya refers to the beginning of the stative event 
(not the beginning of O, N or C phase). 


(12) waandyaga kosaata haángi 
o-á-andy-ag-a ^ ko-saat-a again 
3sG-begin-REC-FV INF-be_sick-Fv again 


‘S/he has begun to be sick again’ 


Another way which shows that statives and achievements fall into separate 
classes is their (in)compatibility with the persistive form -táá- ‘still (being) plus 
the progressive form -lu-. We previously saw that this test is usually incompat- 
ible with achievements (e.g. 10) or targets the extended onset phase in those 
achievements it co-occurs with (e.g. 11). Statives with -táá- + -Ir- denote the 
continuation of a state, as exemplified in (13). 


(13) ataälr alm saata 
a-táá-li a-Iir-saat-a 
3sc-still-aux 3sG-PROG-be_sick-FV 
“S/he is still sick’ 


DURATIVES, as illustrated in Figure 6, have an extended nuclear phase. There 
are two subclasses of durative verbs. The first subclass, called ACTIVITIES, in- 
cludes verbs which indicate a clearly identifiable process of an event from mo- 
ment to moment. Examples in this class are verbs such as -lyad ‘eat’, -Irma 'cul- 
tivate’, -zugá ‘cook’, -ditrla ‘pour into’, -poola ‘pound’, -zeenga ‘build’, -ja ‘go’, 
-lila ‘cry’, -peela ‘run’, -seka ‘laugh’ -shooka ‘reverse, move backwards’ and -iza 
‘come’. The second subclass of durative verbs, called SEMELFACTIVES (Seidel's 
(2008) “instantaneous” predicates), in principle, includes verbs whose nuclear 
phase is punctual, if the verb’s event is viewed as a single event. But, based on 
the diagnostic test used in this study, semelfactives indicate an iterative series of 
the event, thus classified as a durative aspectual class with an extended nuclear 
phase. This type of aspectual class includes verbs such as -kolola ‘cough’, -luma 
‘bite’, -boola ‘hit’ and -luusa ‘kick’. 
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(O) | N | (C) 


Figure 6: Durative event structure 


Although semelfactives (when perceived as a series) resemble activities in hav- 
ing an extended nuclear phase, the nuclear phase of semelfactives, as previously 
stated, does not encode a clearly identifiable process like that of activities. The 
main difference between activities and serial semelfactives can be seen in the in- 
terpretation of the persistive form -táá- ‘still (being)’ plus the progressive form 
-lir-. In activities, -táá- plus -Irr- indicates a clear duration of an event. For exam- 
ple, in (14a), the process of lifting and setting down each foot in turn at a speed 
faster than a walk occupies most of the duration of the event. In semelfactives, 
-taa- plus -lrr- does not describe a comparable process. In (14b), -táá- plus -lrr- 
indicates serial or periodic events, i.e. the event of coughing still holds serially 
or periodically. 


(14) a. ataalr alrtpeela 
a-taa-lr a-In-peel-a 
3sG-still-AUX 3SG-PROG-run-FV 
'S/he is still running' 
b. ataáli alrtkolola 
a-táá-li a-In-kolol-a 
3SG-still-AUX 3SG-PROG-tremble-Fv 


‘S/he is still coughing (periodically)’ 


For the purpose of this study, it is important to note that although all activity 
verbs in Nyamwezi encode an extended nuclear phase, motion verbs with direc- 
tional interpretation and a goal (location phrase) such as -ja ‘go (home)’, -peela 
‘run (to the shops)’, -shooka ‘reverse, move backwards’ and -iza ‘comeback to- 
wards a particular point’ behave somewhat differently from other duratives and 
other motion verbs such as -duumoka ‘jump’ and -firtá ‘pass’. I categorize these 
verbs as DIRECTED MOTION VERBS (different from other duratives) because it is a 
category of verbs which can naturally occur with -g-...-ile to encode a reading 
that I have referred to as “progressive-like” in this study (discussed in $4.3). 


4 Interactions -e-...-íle with aspectual classes 


There is a correspondence relationship between lexicalized temporal phases (O, 
N, and C) (discussed above) and tense/aspect forms. Tense/aspect forms act as 
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phase-selectors that “pick out” or “select” matching phases encoded in the lexi- 
cal/phasal dimension (Sasse 2002: 223). In the discussion of the interactions be- 
tween -g-...-ile with aspectual classes in the following subsections, I will argue 
that one of the main functions of -g-...-ile in Nyamwezi, as in Totela (Crane 2013), 
is that of a stativizer, i.e. it asserts a state or property of the subject, resulting 
from the referenced situation/event. In other words, -g-...-ile picks out a phase 
of an event or the entire event (if the event lacks phasic structure) and presents 
it as a stable, undifferentiated property. -g-...-ile does so in three different ways, 
which give three different readings. First, in achievements, -g-...-ile selects the 
coda phase of the situation. In this case, -g-...-ile denotes a state that exists as a 
result of the situation described by the verb constellation, hence it gives a resul- 
tative reading. Second, in statives, -2-...- íle selects the stative phase which is part 
of a situation's inherent structure. In this case, -2-...-íle asserts a state which is 
equivalent to the permanent state of the situation described by the verb, hence 
it gives a general present time reading. Lastly, in directed motion verbs, -e-...-íle 
selects the nuclear phase of the situation. In this case, -g-...-ile asserts a state of 
the verb's situation which is currently underway, hence it gives a reading which 
is translated with progressive form in English (this reading will be referred to as 
progressive-like). The three readings of -g-...-ile in Nyamwezi: resultative, gen- 
eral present time, and progressive-like, are exemplified in Table 2. 

Notice that other activity verbs, apart from directed motion verbs, are usually 
unacceptable with -g-...-ile in Nyamwezi, unless the verb suggests a change-of- 
state or condition to the verb's subject, which has the semantic role of the patient 
(patientive subject), or it is used in a special pragmatic condition where -z-...-íle 
is coerced. This special pragmatic condition gives a second main function of - 
g-...-ile which will be discussed in $4.4. The interaction of semelfactives with 
-g-...-ile, for reasons that will be provided in 84.4, is generally not acceptable in 
Nyamwezi. 


4.1 -g-...-ile with achievements 


Achievements predominantly have a punctual nuclear phase which indicates a 
transition from one state to another. This is an aspectual class which typically 
occurs with the construction -2-...-íle to encode a resultative reading, as exem- 
plified in (15). As previously noted, this reading indicates that the result of a past 
situation holds at the moment of speech (cf. Comrie 1976; Bybee et al. 1994). 


The noun prefix ka- expresses a diminutive. 
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Table 2: Readings of -e-...-íle by aspectual classes 


Aspectual classes ` Temporal interpretations Examples 
Achievements Resultative aginilé 
a-9-gin-ile 


3sG-become fat-ile 
‘S/he is fat (because s/he 
got fat in the past)’ 


Statives General present time kasaatilé 
ka!-g-saat-ile 
cl.12-be_sick-ile 
“S/he (small child) is sick’ 


Activities Progressive-like azíílé koPügu 
(Directed motion a-9-z-ile ko-Pugu 
verbs) 3sG-go-ile Loc-Pugu ‘S/he 


is going to Pugu' 


(15) a. goofupnág iiss lyáa-kopií nhana gálolilé 
ga-ko-fun-ag-a issu lyaá-kopií nhana ga-lol-ile 
cl.6-HAB-give_out-HAB-FV smell cow-to be truth cl.6-sour-ile 
‘It gives out a smell indicating that it really is sour’ 

b. kipeélile gáshy' worhgati choopi kiólilé 
kr-o-feél-íle ` gashí ^ o-mgatn ch-ofiií kr-e-pol-íle 
cl.7-be nice-íle whereas A-inside cl.7-Aux cl.7-become rotten-íle 
noolo kifíípilé 
noolo kr-e-piíp-íle 
Or cl.7-become bad-íle 


‘It appears to be nice, but inside it is rotten or bad? 


c. nwaana ronooyo aginilé yé 
1w-aaná ronooyo a-e-gin-ile yé 
cl.1-child DEM 3sc-become fat-íle very 


"The child is very fat. 


The resultative reading indicates that a causal relationship exists between the 
past/prior eventuality and the current state. For instance, in (16) the situation of 
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being sour resulted from the situation of becoming sour. This relationship, as ex- 
emplified and illustrated in (17), can be clearly shown using the progressive form 
-lrr-, which indicates the prior situation, and the construction -g-...-ile, which in- 
dicates the current state. In (17), a horizontal dotted line indicates the flow of an 


event from onset phase to coda phase; the shadow marks the phase encoded by 
-lrr- and -e-...-íle. 


(16) a galulola 
ga-In-lol-a 
cl.6-PRoG-become sour-Fv 
'It is becoming sour: 
b. galolilé 
ga-o-lol-ile 
cl.6-6-become sour-íle 


‘It is sour. 


(17) Event-structure construals of the progressive form -lu- and the construc- 
tion -g-...-ile 


a. O C 
TP eps TN TI D ESS Ses és > 
becoming sour being sour 
arie e e AU pi; m SAI E, ET E > 
becoming sour being sour 


All examples given in this section express a resultative reading, a reading 
which focuses on the present/ongoing state resulting from an event. This reading 
should not be confused with the perfect (of result) reading, a reading which fo- 
cuses on the past event which resulted in a state of affairs, which in English can 
be exemplified by I have built a house next to the river. Evidence that -g-...-ile in 
Nyamwezi denotes a resultative reading (in achievements), and not a perfect of 
result, is its compatibility with -táá- ‘still (being), which indicates that the result 
state of the situation still holds (is extended) (18). 


(18) a. ataáli agmilé 
a-taa-li a-9-gin-ile 
3sc-still-Aux 3sc-become fat-íle 
‘s/he is still fat 
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b. ataälr amánílé kweendéésha asikélí 
a-taa-li a-o-man-íle ko-endéésh-a pasikélí 
3sc-still-Aux 3sc-get to know-íle INF-cycle-Fv bicycle 


‘S/he still knows how to ride a bicycle’ 


Co-occurrence of - táá- ‘still (being)’ plus -9-...-ile in Nyamwezi is a good piece 
of evidence for the resultative reading because resultatives, unlike perfects, in 
principle, are incompatible with ‘still’ (cf. Dahl 1985:134). For example, in English 
*I have still built a house next to the river is not acceptable. 


4.2 -g-...-ile with statives 


In Figure 3, it is stated that statives denote an indefinite state. These aspectual 
classes have no internal change; thus I argued that they lack a phasic structure. 
There are very few statives in Bantu languages in general, and in Nyamwezi 
in particular. Non-tense-marked statives in Nyamwezi denote a general present 
time reading; a reading in which the state of the situation having occurred is 
equivalent to the prior situation. For instance, all phases of the event asaatilé 
‘S/he is suffering’ in (19) are identical. Whichever point of time we choose to cut 
in on this event, we shall find exactly the same situation. 


(19) a. asaatilé kwrngila kofyyaálwa kwaákwé 
a-g-saat-ile ` kwrngilako-pyaál-w-a ^ kwaákwé 
3sG-be sick-íle since INF-bear-PASS-FV POSS 


‘S/he has been suffering since her birth: 

b. Batogilwé ^ kofawrlaa  fiíchaafó pátomame 
pa-e-tog-ílwe ku-Ba-wul-a a-fiíchaapó pá-tomám-eé 
3P1-love-ile INF-oc-tell-Fv cl.1-companions their cl.2-work-oPr 


"They like to tell their mates that they should work (hard). 


Statives in this language do not neatly indicate a cause-result relationship be- 
tween the prior eventuality and the current state, thus both the progressive form 
-lu- and the construction -g-...-ile give more or less the same interpretation, as 
shown in (20). 


(20) a. kalusaata 
ka-Irr-saat-a 
cl.12-PRoG-be sick-Fv 


‘S/he (small child) is sick? 
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b. kasaatilé 
ka-g-saat-ile 
cl.12-be_sick-ile 
“S/he (small child) is sick’ 


Although both -lrr- and -e-...-íle receive the same translation in English, there 
is a difference having to do with whether an event is bounded or unbounded. The 
progressive form -lrr- suggests that the event holds at the moment of speaking 
but it will come to an end sometime in the future, whereas the construction -2- 
…-ile does not give this implication. It just describes a (permanent) state. This 
distinction can be equated with Kratzer's (1995) distinction between stage-level 
statives which refer to temporary states and individual-level statives which refer 
to permanent states. Schematically, the difference between -lrr- and -g-...-ile is 
shown in Figures 7 and 8, where a bold vertical line indicates boundedness of an 
event in Figure 7 (stage level). The lack of this line in Figure 8 implies the event 
is unbounded (individual-level). 


ka-lu-saat-a 


Figure 7: -līr- event structure 


ka-9-saat-ilé 


Figure 8: -g-...-ile event structure 


It is hard to find a context where the difference between -lrr- and -e-...-íle oc- 
curs. One of the language consultants for this study mentioned that in many 
contexts he can use both ways to express the same meaning. The difficulty in 
finding a context where one can appreciate the difference between the two for- 
matives lies in the fact that the situation of getting sick for instance, which for 
the case of achievements can be selected by the progressive, is equal to the cur- 
rent state (which in achievements is picked out by the -g-...-ile). Statives do not 
show this type of relationship. 

In order to find the difference, consultants were asked to tell which of the 
two forms they can use when talking about chronic diseases. They all agreed 
that the construction -ø-...-íle is appropriate when talking about chronic diseases 
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(hence unacceptable in 20b), while the form -/rr-is only used to talk about curable 
diseases like malaria (hence acceptable in 20a). The use of -g-...-ile when referring 
to chronic diseases gives some evidence that this construction really describes a 
state, while -lrr- denotes an event which is true for the time being, but probably 
it will not persist in the coming days. 


(21) [Context: Someone has arrived and is looking at the children who are 
playing at the house compound. As she was expecting to see J, but J is 
not there, she then asks “where is J today?”] 


* 


a. * waaláálaga mkaaya, asaatilé málelíyá 
o-á-laál-ag-a m-kaaya, a-g-saat-ile ^ malelíyá 
3sG-CPL-sleep-REC-FV LOC-homestead 3sG-be_sick-ile malaria 


'She is sleeping inside the house, suffering from malaria? 


b. waaláálaga mkaaya, alitsaata 
o-á-laál-ag-a mu-kaaya, a-In-saat-a 
3sG-cPL-sleep-REC-FV LOC-homestead 3sc-PROG-be sick-rv 
malelíyá 
malelíyá 
malaria 


'She is sleeping inside the house, suffering from malaria? 


4.3 -g-...-ile with activities, classified as “directed motion verbs” 


As described in the previous section, duratives, in general, have an extended 
nuclear phase (as shown in Figure 9). Duratives are classified into two subclasses: 
activities (those in which the nuclear phase indicates internal change of an event 
from moment to moment, e.g. -lyaá ‘eat’ and -peela run and serial semelfactives 
(those in which the nuclear phase constitutes multiple occurrences of a situation 
when perceived as series, e.g. -kolola laugh"). 


(O) N (C) 


Figure 9: Durative event 


In the discussion of activity verbs in 83.2, I stated that motion verbs with di- 
rectional interpretation and a goal (called directed motion verbs), e.g. -peela 'run 
(to the store)’, behave somewhat differently from other duratives. These verbs 
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in Botne & Kershner’s model do not specify the coda phase, a phase at which 
a result state comes about. So, they are schematically represented in Figure 10 
where the last phase is nuclear phase. 


(O) N 


Figure 10: Event structure for directed motion verbs 


We saw in example (16) that in aspectual classes which encode a resultant/coda 
phase, -g-...-ile picks out this phase and gives a resultative reading. Since in 
directed motion verbs the coda phase is not part of the situation's event (see 
Figure 10), -g-...-ile picks out the nuclear phase and gives a reading which can 
be translated with progressive form in English (progressive-like reading) (22). 
Ebert (1995: 194) notes a similar phenomenon in Arabic where what is called 
the "Active Particle" form, which usually indicates perfect/resultative meaning 
in other verbs, in verbs of posture, of motion and of sensory perception expresses 
a progressive-like reading. 

It is important to state that -e-...-íle in both achievements and directed motion 
verbs is attuned to the initial (left) edge of the nuclear phase. This means that in 
achievements the perspective is in the stative coda phase, but in directed motion 
verbs it is in the nuclear phase. 


(22) a. apeelilé ^ kojaa komadooka 
a-g-peel-íle ko-ja | Kko-madooka 
3sG-run-ile INF-go-FV LOC-shops 
‘S/he is/has run off to the shops. 
b. gaängr  giizilé nágaángi gashookilé 
ga-ángi ga-9-iz-ile na-gaangi ga-shook-ile 
cl.6-other cl.6-come-ile and-other cl.6-return-ile 


€ . . . " ` , 
Some (e.g. lions in a narration) are coming and some are going back. 


The progressive-like reading in directed motion verbs, as exemplified above, is 
also a resultative in its function, just like that of achievements. The progressive 
reading in these verbs is a matter of the English translation (cf. Ebert 1995:189). 
The difference between a resultative reading of achievements and that of directed 
motion verbs is that, in achievements the result state is triggered by a nuclear 
phase which denotes the change-of-state, whereas in directed motion verbs the 
initial point (onset phase) is regarded as a point of change (cf. Smith 1991:70). 
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Since the phase following this point of transition in directed motion verbs is a 
nuclear phase, -g-...-ile selects this ongoing/dynamic non-stative phase to encode 
a stative(-like) meaning along the lines of ‘be directed towards a certain goal’. 

One piece of evidence is present to justify the claim that directed motion 
verbs encode a kind of change-of-state which is similar to that of achievements. 
Both directed motion verbs, as shown in (23), and achievements (see e.g. 18 re- 
peated in (24)) occur with the persistive form -táá- ‘still (being) plus -g-...-ile. 
Co-occurrence of this construction with directed motion verbs indicates that a 
state that results from an ongoing situation still holds. 


(23) ataálı alı mnzia azíílé kaayá 
a-taa-lr a-lı mu-nzila a-j-ile kaaya 
3sc-still-Aux 3sc-COP roc-path 3sc-go-ile home 


'S/he is on the way (going) home' 


(24) ataälr aginilé 
a-taa-li a-9-gin-ile 
3sc-still-aux 3sc-become fat-íle 
“S/he is still fat’ 


In directed motion verbs, English makes no distinction between the progres- 
sive form -lrr- and -g-...-ile. So, both a-lrr-peel-a and a-peel-ilé are translated in 
English as ‘s/he is running’. The difference between -l- and -e-...-íle, however, 
can be settled by context. For instance, in (25), the context is such that the speaker 
is watching a movie but he missed a part in the movie where the actress (Kajala) 
saw a lion and started to run away. He then asks another person who started to 
watch the movie before he came "Why is Kajala running away?" (koongono-kr 
o-kajala a-lu-peel-a?). A response to this question with the progressive form -lrr- 
sounds fine (25a), but one with -g-...-ile sounds odd (25b). 


(25) a. alnpeel iishiímbá 
a-In-peel-a i-shiimba 
3sc-PROG-run-rv cl.5-lion 
‘She is fleeing from a lion’ (Lit: ‘She is running away from the lion’) 
b. * apeelil’ tishiimba 
a-9-peel-ile i-shiimba 
3sc-run-ile cl.5-lion 


‘She fled from a lion’ (Lit: ‘She is running away from the lion’) 
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Likewise, the progressive form -lu- cannot be used to refer to the continuing 
state of the subject, which is encoded by -g-...-ile. For instance, (26) is in a context 
where the teacher is looking for a missing student (Anna). He then asks other 
students who were outside the class playing, "Have you seen Anna today?" (w- 
aa-mu-Bon-ag-a o-ana leeloó?). A response to this question with the progressive 
form -Irr- sounds odd (26b), but one with -g-...-ile sounds fine (26a). 


(26) a. waafrtaga apeelilé 
o-á-pit-ag-á a-g-peel-ile 
3SG-CPL-pass-REC-FV 3SG-PROG-run-ile 
‘She passed (here) running’ 

b. * waafrtaga alrrpeela 
ü-à-fit-ag-à a-In-peel-a 
3SG-CPL-pass-REC-FV 38G-PROG-run-FV 


She passed (here) running’ 


The co-occurrence of the -lrr- and -e-...-íle in the same construction provides 
further evidence that the two forms do not perform the same function. One (-g- 
...-ile) indicates a state and the other (-/rr-) indicates the action in progress. This 
distinction is illustrated in (27) where the point of speech is now, the point of 
reference is in the past, established by -firtá ‘pass’, continued by -peela ‘run’ and 
directed towards a location by -ja ‘go’. The continuation of the event of running 
can be construed as a dynamic state (27a) or a process (27b). Similarly, -ja ‘go’ 
which indicates the direction ofthe event can be construed as a process (27a) ora 
dynamic state (27b). This relationship is further expressed by literal translations. 


(27 a. waafitaga apeelilé, ^ alnja kaaya 
o-á-pit-ag-á a-9-peel-ile, a-1n-j-a kaaya 
3SG-CPL-pass-REC-FV 3sG-run-ile 3sG-PROG-go-FV homestead 
Lit. ‘She passed (here), she was running going home’ 

‘She passed (here), having run, going home: 

b. waafritaga almpeela, aziilé kaáya 
0-á-1t-ag-á a-In-peel-a a-z-ile kaaya 
3SG-CPL-pass-REC-FV 3SG-PROG-run-FV 3sG-go-ile homestead 
Lit. ‘She passed (here) running, s/he was going home’ 

‘She passed (here) running, gone off home’ 


Another piece of evidence is that neither the progressive form -lrr-, as in (28a), 
nor the construction -g-...-ile, as in (28b), can be used with both -peela ‘run’ and 
-ja ‘go’, as each form performs a different function in Nyamwezi. 
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(28) a. * waafrtaga alipeela, alrrja kaaya 
o-á-it-ag-á a-In-peel-a a-Ir-j-a kaaya 
3SG-CPL-pass-REC-FV 3SG-PROG-run-FV 3sG-PROG-go-FV homestead 


Lit. ‘She passed (here) she was running, s/he was going home’ 
‘She passed here, she was running towards home’ 

b. * waaBrtaga apeelilé ^ azíílé kaáya 
o-á-pit-ag-á a-g-peel-ile a-z-íle kaaya 
3SG-CPL-pass-REC-FV 3sc-run-ile 3sc-go-íle homestead 
Lit. 'She passed (here) running, going home' 

‘She passed here, she was running towards home: 


4.4 -g-...-ile with other activities and semelfactives 


In the previous section, I argued that directed motion verbs in Nyamwezi behave 
somewhat differently from other activity verbs and (serial) semelfactives. Di- 
rected motion verbs, as we have seen, do not lexically encode the (resultant state) 
coda phase, they only encode an optional onset phase and a nuclear phase. In 
directed motion verbs, -e-...-íle selects the nuclear phase and gives a progressive- 
like reading. 

Another category of activity verb includes verbs such as -sha ‘grind (e.g. corn)’, 
-ditila pour (e.g. water) into (a receptacle)’, -poola ‘pound (e.g. corn)’, -zeenga 
‘build (e.g. a house)’ -Irma ‘cultivate (e.g. a farm)’ and -chifä ‘block hole/way, 
plug’, which, compared to directed motion verbs, lexically indicates a durative 
nucleus and an entailed resultant/coda state (cf. Persohn's (2017b) processes with 
a resultant state). In many of these verbs in Bantu languages, - íle tends to pick out 
a resultant/coda state phase to give a perfect of result reading (see e.g. Luwanga 
and Lusaamia in Botne (2010)) or resultative reading (see e.g. Totela in Crane 
(2013)). The same is also observed in Nyamwezi, verbs such as -sha ‘grind’ and 
-ditila ‘pour into’, as exemplified in (29a) and (29b), respectively, occur with -e- 
..."Íle to encode a patientive resultative reading. In this reading, the verb's sub- 
ject, which has the semantic role of the patient (patientive subject), undergoes 
a change-of-state. Note that some verbs in this category permit the causative 
alternation in which the patientive resultative reading is triggered by both -ø- 
...-ile and the passive form -w- (29c), though some take the passive form but do 
not allow causative alternation (29d-e)*. For the purpose of this study, I will not 
go into detail to discuss the question why some verbs in this category permit 


"Por comparison, an example below indicates a sentence with agent-verb-patient S-V-O struc- 
ture in Nyamwezi. 
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causative 


alternation while others do not. Further studies are required on the 


lexical semantics of these verbs. 


(29) a. 


“ashiilé vs. zishiilé 

a-g-sh-ile zi-o-sh-ile 

3sG-grind-ile  cl10-grind-ile 

‘S/he has ground (the peanuts)’ “They (peanuts) are ground: 


*adıtılílé vs. yrditililé 

a-9-ditil-ile yr-e-dirtil-íle 

3sG-pour into-íle  cl9-pour 

‘S/he has poured (water) into a vessel! ‘It (a vessel) has been poured 
into’. 

achifilé vs. yrchipílwé (*yrchißílé) 
a-ø-chiß-íle y1-9-chiB-il-w-e 

3sc-block-ile  cl.9-block-pass-ile 

‘S/he has closed (a way)’ ‘It (the way) is closed’ 
*apoolílé vs. gapoolílwé (gapoolílé) 
a-o-pool-ile ga-o-pool-íl-w-e 

3sc-pound-ile — cl.6-pound-Pass-ile 

‘S/he has pounded (the corn)’ ‘It (corn) is pounded’ 


*alrmilé vs. lilmilwé 
a-o-lrm-íle li-g-lim-il-w-e 
3sG-cultivate-ile 3sG-cultivate-Pass-ile 


‘She has cultivated (a farm.) ‘It (a farm) is cultivated’ 


Verbs shown above are subject to a second type of result state analysis which 
is similar to that of achievements. The resultant state in these verbs involves 


a “durative transition” from one state to another, whereas in achievements the 


change is 


thought of as occurring instantaneously (cf. Pustejovsky 1991). 


So far, we have discussed two categories of activity verbs, i.e. those which do 
not lexically encode a resultant/coda phase (directed motion verbs), and those 


(i) 
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activity verbs which typically entail a resultant/coda state and suggest a change- 
of-state of the verb's patientive subject. Another category of verbs includes verbs 
such as -lila ‘cry’ and -seka ‘laugh’, which resemble directed motion verbs in 
encoding an optional onset phase and a nuclear phase without a resultant/coda 
phase. -g-...-ile in general does not occur with these verbs without additional 
context. The context where the interaction of -g-...-ile with verbs such as -seka 
‘laugh’ is accepted is when the speaker needs to show her/his ability towards 
what has been conceived impossible by other people. For example, in (30), the 
speaker may use -g-...-ile to mean that s/he has the ability to make the person 
referred to laugh (something which no one else did). 


(30) a. okowaaneé asekílé 
o-ko-waaneé a-g-sek-ile 
A-Loc-Poss 3sG-laugh-ile 


"To me, s/he must laugh (just wait and see)’ 


In (29), we saw that -sha ‘grind’, -ditila ‘pour into’, and -poola ‘pound’ do not 
naturally occur with -g-...-ile if the verb’s subject has the semantic role of agent 
(agentive subject). However, the speaker can use -g-...-ile if s/he wants to express 
a contradiction and/or emphasis in the similar way as verbs without an entailed 
resultant/coda phase. This is contextualized in (31) using the verb -/rma ‘culti- 
vate’. 


(31) [Context: It is a rainy season, Benny is uncertain whether his brother 
Frank has cultivated. While talking with his neighbors, he says “I wonder 
whether Frank has cultivated this season or whether he even has a farm” 
If one of the neighbors is sure that Frank has cultivated s/he can respond 
using -2-...-íle.] 
alrmilé 
a-ø-lım-íle 
3sc-cultivate-ile 


‘He has cultivated (and I saw his farm.)’ 


In all verbs where -g-...-ile is not acceptable without context, I argue that 
-g-...-ile is coerced to occur with these verbs in order to express a contrast/ 
contradiction and/or emphasis. Collins (1962), as cited in Crane (2012: 70), re- 
ports a similar phenomenon in Tonga, where -ide (another variant of - ile) is used 
to express contradiction (32). See also Woidich (1975), as cited in Ebert (1995: 194), 
for a marker of resultative expressing emphasis. 
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(32) (Collins 1962, as cited in Crane 2012:70) 
micelo ili bolede (- bola ‘become rotten’) 


‘the fruits are rotten, or the fruits are not perfectly sound’ 


According to Collins (1962), the sentence in (32) would be used in argument 
or to express disappointment or surprise. 

The use of -g-...-ile in Nyamwezi to express contradiction/emphasis is differ- 
ent from that of a stativizer which gives readings such as resultative/stative and 
progressive-like. Contradiction and/or emphasis can be regarded as a second 
main function of -g-...-ile which is enforced by context. 

Let's now turn to the discussion of the interaction of semelfactives with -g-...- 
ile. In all contexts attested during elicitation sessions, -g-...-ile was not accepted 
with serial semelfactives. Since (serial) semelfactives resemble activities in encod- 
ing an extended nuclear phase, one of the ways to test co-occurrence of -g-...-ile 
with semelfactives was to see if the interaction would lead to the progressive- 
like reading. To try this, the consultants were asked to say whether they would 
use a-9-kolol-ilé (from -kolola ‘cough’) to mean that someone is coughing. The 
example meant to check if the sentence may get an iterative interpretation (sev- 
eral instances of an event within a measurable duration) (33a) or denote periodic 
events (33b). Both interpretations were rejected. 


(33) a. *ataálr akololílé 
a-táá-I a-9-kolol-ile 
3sc-still-aux 3sc-cough-íle 
‘S/he is still coughing (continuously) 
b. * akololilé kofima mázoolí 
a-9-kolol-ile kofuma mazoolí 
3sG-cough-íle since day before yesterday 


'S/he has been coughing since the day before yesterday' 


Semelfactives cannot occur with -g-...-ile, even though they encode a kind of 
extended nuclear phase which can be picked out by this construction. One of 
the reasons for the co-occurrence restriction is that for a verb which does not 
naturally occur with -g-...-ile it needs a context where this formative can be co- 
erced to occur with that verb. Coercion of -g-...-ile normally suggests a contradic- 
tion/emphasis. For this reason, it is very awkward for a speaker to use -g-...-ile 
with semelfactives to imply that s/he has the ability to enforce the occurrence of, 
let us say, 'cough' which is difficult to do to other people, in the same way it is 
possible with -seka ‘laugh’ in (30). So, the co-occurrence restriction is due to the 
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inability of the speaker to enforce events denoted by semelfactives. Otherwise, if 
the speaker believes that s/he has the ability to enforce a semelfactive event (as 
in a non-literal context), then s/he may use -z-...-íle. 

To conclude this section, -2-...-íle, in general, does not commonly occur with 
activity verbs. The co-occurrence is subject to two conditions (i) the verb denotes 
a result state that leads to the change-of-state/condition of the verb's patientive 
subject, and (ii) the verb is used in a special pragmatic condition where -g-...-ile 
is coerced to express contradiction and/or emphasis. 


5 Conclusion 


To summarize, the paper has generally shown that the analysis of aspectual 
classes is crucial in understanding the functions of -e-...-ílein Nyamwezi. A resul- 
tative reading occurs with achievements, a general present time reading occurs 
with statives, and a progressive-like reading occurs with a handful of activity 
verbs called directed motion verbs. It was also shown that there are some activ- 
ity verbs, apart from directed motion verbs, which occur quite naturally with 
-g-...-ile. These verbs share some commonalities with achievements, as they both 
lead to the change-of-state of the verb’s patientive subject. More importantly, 
it was noted that, activity verbs which lack an entailed resultant/coda phase, in 
special pragmatic contexts, can be coerced to occur with -g-...-ile to express con- 
trast/contradiction and/or emphasis. It was also argued that although coercion 
seems to be an option in contexts where -g-...-ile is not allowed with some verbs, 
this is not the case with semelfactives. Coercion of -g-...-ile with semelfactive 
verbs in general is not allowed due to the inability of the speaker to enforce 
events denoted by semelfactives. 

In short, -g-...-ile in Nyamwezi has two main functions: (i) statitivizer, which 
leads to three readings - resultative, general present time and progressive-like 
- and (ii) contradiction/emphasis. These results suggest that in other Bantu lan- 
guages a more detailed analysis of -íle can yield valuable results, which may 
previously have gone unnoticed. 
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Abbreviations 

A Augment NEG Negative 

APPL Applicative oc Object concord 

ASS Associative OPT Optative 

AUX Auxiliary PASS Passive 

CL noun class PL Plural 

cPL  Completive marker poss Possessive 

CON  Connexive PROG Progressive 

coP  Copular REC Recent past 

EXT Extension RS Resultant state 

FV Final vowel SC Subject concord 

HAB Habitual SG singular 

IMP Imperative TAM Tense, Aspect and Mood 
INF Infinitive Tc/sc Tense copy of subject concord 
IT Itive marker VB Verbal base 


LOC Locative 
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Chapter 16 


Logophoricity in Ibibio 


Lydia Newkirk 
Rutgers University 


This paper presents a description and analysis of logophoric pronouns in Ibibio. 
I show that Ibibio logophors, although they behave in most respects like typical 
logophoric pronouns in West African languages, obey Shift Together like shifted 
indexicals. In order to explain this data I propose that Ibibio logophors are sensitive 
to two operators in the left periphery of the embedded CP: a shifting operator and 
a logophoric binding operator. Ibibio indexicals (which do not shift) differ in that 
they are defined to be insensitive to shifting operators. Thus indexical shift requires 
cooperation between the semantics of the indexical and of the shifting operator. 
This proposal in turn expands the predicted typology of possible de se pronouns 
cross-linguistically. 


1 Ibibio logophors 


Ibibio (Cross-River, Nigeria) logophors are distinct from both ordinary pronouns 
and reflexives (1), and can only occur in embedded clauses (2).! 


(1) a Ekpe;a-bo ke(img);i-ma i-to Udo logophor 

Ekpe 3sc-say C LOG — Loc-PsT Loc-hit Udo 
‘Ekpe; says that he; hit Udo’ 

b. Ekpe;a-bo  keanyej;,a-diyono ikwo ikwo mfonmfon pronoun 
Ekpe 3sG-say C 3sG 3sG-know sing song well 
‘Ekpe; says that bet sings well’ 

c. Ekpeja-bo  kei-ma  i-to idem; reflexive 
Ekpe 3sG-say C Loc-PsT Loc-hit self 
‘Ekpe; says that he; hit himself; 


‘With the exception of the logophoric markers, on which I annotated the tone for clarity, I use 
the Ibibio orthography developed in Essien (1990), which does not mark tone. 
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(2 a. *Ekpe;a-ma  a-diya adesi imo; logophor 
Ekpe 3sG-PST 3SG-eat rice LOG.POSS 
Intended: ‘Ekpe; ate his; rice’ 
b. Ekpe;a-ma  a-diya adesi amọ;/ķ pronoun 
Ekpe 3sG-PST 3SG-eat rice 3sG.POSS 


‘Ekpe; ate his; rice’ 


Logophoric pronouns differ in both agreement and morphology from the other 
pronominals in Ibibio (both the pronominals and logophors are listed in Table 1). 
However, like all of the other pronouns in the language, their independent forms 
are optional except in cases of emphasis or disambiguation. In (2a), the logophor 
cannot occur as a possessive in an unembedded clause, even though in the same 
context a regular third person pronoun is acceptable as corefering to the local 
subject. 


Table 1: Ibibio pronouns and agreement markers 


Pronouns Agreement 


Subject Object Subject Object 


1SG ami mien n- n- 
2SG afo fien a- u- 
3SG anye anye a- a- 
1PL nnyin i- i- 
2PL ndufo e- e- 
3PL ommo e- e- 
LOG.SG imo i- i- 
LOG.PL mmimo i- i- 


Note from (1b) that Ibibio patterns like Yoruba (Adesola 2005: (3)), which has 
a strong pronoun oun that must refer to the attitude holder, and a weak pronoun 
o that is ambiguous between referring to the matrix attitude holder and taking 
some other third person referent. By that same token, Ibibio’s pattern is distinct 
from Abe’s (Koopman & Sportiche 1989: (4)), in that regular Ibibio pronouns do 
not show anti-logophoricity; a regular pronoun can still corefer with the matrix 
subject.” 


"In these and other examples from the literature, I preserve the glossing and capitalization from 
the cited work. 
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(3) (Adesola 2005: 34) 
Olu; ti kéde NOr;n.roc]pé ounj;/ojyyn  bsù lsóla 
Olu AsP announce that he PROG come tomorrow 
‘Olu; has announced that he;/he;; is coming tomorrow: 

(4) (Koopman & Sportiche 1989: 64a) 
yapi; RE kO Oj/nj(,) ye sE 
Yapi said kO he is handsome 
‘Yapi; said that ber is handsome’ 

Assuming the facts from Abe are robust, this points to one potential variance in 


how logophors and regular pronouns tend to get their antecedents. 
There are also separate plural logophoric pronouns in Ibibio: 


(5) ommo;e-ke cho kemmimo;,,i-ma  i-kot nwet 
3PL 3PL-PST 3PL-say C LOG.PL LOG-PST LOG-read book 
‘They; said that they;/,, read a book 
Split antecedence is possible with a plural logophor (see also Yoruba; Adesola 
2005), so long as the closest potential logophoric antecedent is included in the 
group taken to be the antecedent for the logophor. 
(6) Ekpe;a-bo | kemmimo;;,;í-diya afit adesi ado 
Ekpe 3sG-say C LoG.PL LoG-eatall rice DEM 
‘Ekpe; says that they, ;; } ate all of the rice’ 
(7 Ekpe;a-bo ke Udo;a-kere keete | mmimo/ y; j,k} a-ya 
Ekpe 3sG-say C Udo 3sc-think C father roc.Pr 3SG-FUT 
i-di i-wo 
LOG-come LOG-visit 
‘Ekpe; says that Udo; thinks that their, ; ; y, j,k } father will come visit: 
The inverse relation, where a singular logophor has a plural antecedent, is im- 
possible: 
(8) *[Akpanye  Udo];e-ma  e-bo  keimo;i-ma  í-diya sokoro 
Akpan conj Udo 3PL-PST 3PL-say C LOG LOG-PST LOG-eat orange 
Intended: ‘[Akpan and Udo]; said that he; ate the orange. 


This all is analogous to the facts of Yoruba as reported by Adesola (2005). 
Logophors in Ibibio are subject-oriented, established in (9a-10): 
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(9) a. Ekpe;a-ma  a-kop  keUdo,a-ma  í-kit 

Ekpe 3sG-PsT 3sG-hear C Udo 3SG-PST LoG-see 
‘Ekpe; heard that Udo, saw him,’ 

b. Ekpe;a-ma  a-kop  a-to Akpan; ke Udo, a-ma ` íj,;-kit 
Ekpe 3sG-PsT 3sG-hear 3sG-from Akpan C Udo 3sG-PST LOG-see 
‘Ekpe; heard from Akpan; that Udo; saw him; ; 

(10) Akpan;a-ma  a-dokko Ekpe% ke a-kpe a-na nte (imoj/,4) 

Akpan 3asc-PsT 3sG-tell Ekpe C 3SG-cOND 3s6-MopD C LOG 

i-dep  adesi mfin 

LOG-buy rice today 

‘Akpan; told Ekpe; that he;;,;. should buy rice today: 


Rather than being tuned to the source in (9b), the embedded logophor can only 
refer to the syntactic subject, which 'hear' licenses as a logophoric antecedent 
independently (9a). Similarly, in (10), a logophor cannot refer to the addressee 
introduced in the matrix clause, but must refer to the subject. 

Like what has been reported for Yoruba (Adesola 2005; Anand 2006), but un- 
like what has been reported for Ewe (Pearson 2015), Ibibio logophors are obliga- 
torily interpreted de se: 


Context: Ekpe sings on occasion, but will never admit that he is any good. So one 
time, during one of his performances, you record him without his knowledge. 
Some time later, you play back the recording to him without telling him who 
is singing. Ekpe doesn’t recognize himself in the recording, and comments “he 
sings well.” 


(11) a. Ekpe;a-bo ke anert a-diyono ikwo ikwo mfonmfon 
Ekpe 3sG-say C 3sG 3sG-know sing song well 
‘Ekpe; said that hej, sings well? 
b. #Ekpe;a-bo ke img; i-me i-diyono ikwo ikwo mfonmfon 
Ekpe 3sG-say C LOG LoG-PRES LOG-know sing song well 


Intended: ‘Ekpe; says that he; sings well? 


? De se in this case refers to knowing self ascription of a property. For example, the de se reading 
for “John believes that he wrote the best paper" is the reading where John has the belief “I 
(John) wrote the best paper.” This differs from a de re reading, where John may have read his 
paper without remembering that he wrote it, although he still comes to the conclusion that 
the paper that he wrote (but does not remember) is the best paper. Because in this case John 
does not identify himself as the writer of the best paper, this latter interpretation is de re. 
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In the above context, Ekpe does not knowingly attribute singing well to himself, 
but instead does so accidentally. That is, he only ascribes singing well to himself 
de re, but not de se. In such a context, only a regular Ibibio third person pronoun 
can be used, and the logophor is illicit. 

When multiply embedded, Ibibio logophors can take antecedents more than 
one clause away: 


(12) Ekpe;a-bo | keUdo,a-ke a-kere ke (imo;/r) i-ke i-kit 
Ekpe 3sG-say C Udo 3sc-pst 3sc-think C LOG LOG-PST LOG-see 
Ima 
Ima 


‘Ekpe; says that Udo, thinks that bet saw Ima’ 


But when there is more than one logophor in the same clause, the coreference 
options are more limited: 


(13) Ekpe;a-ma  a-kop  keUdo,a-ke a-bo ke ayin-eka imo},.; 
Ekpe 3sG-PST 3sc-hear C Udo 3sG-PST 3sG-say C brother LOG.POSS 
a-ma a-kit  imoy,.; ke udua 
3SG-PST 3SG-see LOG at market 
‘Ekpe; heard that Udo, said that bist: brother saw him,,,; at the 
market. 

(14) *Ekpe;a-ma  a-kop  keUdo,a-ke | a-bo ke imoj, i-ma 
Ekpe 3sG-PST 3sc-hear C Udo 3SG-PST 3sG-say C LOG  LOG-PST 
i-to  imo;x 
LOG-hit LOG 
Intended: ‘Ekpe; heard that Udo, said that hej, hit bim at 


While (13) is grammatical, the only available interpretation is the one where both 
logophors take the same antecedent.* More striking is that in (14) this effect is 


“An anonymous reviewer asked whether multiple multiply-embedded logophors must also take 
the closest antecedent. Unfortunately I do not have data to confirm whether this is in fact the 
case. However, if Ibibio logophors indeed behave like shifted indexicals (as I will claim), then 
it should be possible for the two logophors to take a more distant antecedent, as in Zazaki 
(Anand & Nevins 2004): 


(i) (Andrew): Ali, miy-rava ke Heseniy toy-ra va Soa) brave Rojda-o 
Ali me-to said that Hesen  you-to said I brother Rojda-GEN 


‘Ali said to Andrew that Hesen said to Andrew that { Hesen, Ali, «Andrew } is Rojda's 
brother? 
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not ameliorated even to avoid a Condition B violation.? Instead, the sentence is 
simply ungrammatical. Similarly the context in (15) fails to license two embedded 
logophors: 


Context: Udo and Akpan are two young schoolchildren, and are brothers. Ekpe 
is their friend, and is the same age as they are. One day, Udo and Akpan’s father 
comes home and says that he saw Ekpe at the market when Ekpe was supposed 
to be in school. Word that he has been spotted skipping class gets back to Ekpe. 


(15) #Ekpe;a-ma  a-kop ke [Akpan ye Udo], e-ke  e-bo ke 
Ekpe 3sG-pst 35G-hear C Udo conj Akpan 3PL-PST 3PL-say C 
ete mmimo, a-ma  í-kit | imojke  udua 
father LOG.PL.POSS 3SG-PST LOG-see LOG PREP market 
Intended: ‘Ekpe; heard that [Akpan and Udo]; said that their; father 
saw him; at the market? 


The context in (15) does not support partial/split antecedence for the plural lo- 
gophor, and so the plural embedded logophor has to take a different antecedent 
than the singular logophor. But (15) is completely infelicitous in this context. In 
sum: Clausemate logophors (in Ibibio at least) have to refer together. 

This particular restriction on logophors is (to my knowledge) unattested in the 
de se literature. Abe, Yoruba, and Ewe are all reported to allow multiple embedded 
logophors to take separate antecedents: 


(16) Abe (Koopman & Sportiche 1989: 41, 44a) 


a. n;ceewun kolo pt 
n friend Det likes n 


‘his; friend likes him), g 


This example is unfortunately complex, because there is potentially multiple shifting acts hap- 
pening. But most importantly, although Hesen can be the antecedent for the deeply embedded 
first person pronoun, that is not the only reading of the sentence. Ali, the more distant attitude 
holder, is also eligible to antecede the shifted first person pronoun. I have no reason to expect 
Ibibio logophors to behave otherwise. 

The reviewer also asked whether reflexives have any impact on multiple logophors or long- 
distance antecedents. Logophoric reflexives are not themselves long-distance reflexives, but 
only local anaphors, and as such would have to have a logophor as a local antecedent, as in 
(1c) 

5A Condition B violation occurs when a (non-reflexive) pronoun is bound locally. For exam- 
ple, John; likes him,,,; is unacceptable on an interpretation where John and him refer to the 
same person (that is, John likes him cannot mean John likes himself), because the pronoun 
would be bound locally, which violates Condition B of the Binding Theory, which states that 
a pronominal must be free within its clause. Note that if the pronoun is not in the same clause 
as its antecedent, binding is possible (e.g., John; said that he; is a genius), because this does not 
violate Condition B. 
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b. Api; bO wu ye nyk kolo nat 
Api believe yen likes n 
‘Api; believes that he; likes bim: 
(17) Yoruba (Anand 2006: 177) 
Olu; so pé Ade,ro pé baba oun;x ti  rHy4 oun;/; 
Olu say that Ade think that father oun gen PERF see mother oun.gen 
‘Olu; said that Ade; thought that his;/, father had seen his;/; mother? 


(18) Ewe (Clements 1975: 73) 
Kofi; xo-e se be Amar gblobe yéj;,z-fu ye, 
Kofi receive-PRo hear that Ama say that LocG-beat LOG 
‘Kofi; believed that Ama; said that he; beat her;’ or 
‘Kofi; believed that Ama, said that she; beat him,’ 


The Abe data requires a bit of explanation: Koopman & Sportiche (1989) report 
that n-series pronouns (logophors) are licit in matrix clauses, but if two of them 
occur they must have the same antecedent (16a). When embedded, however, they 
are able to receive disjoint interpretations. This puts (16b) in the same general 
pattern with the Yoruba (17) and Ewe (18) examples, and all of them in contrast 
with Ibibio. 

In order to explain this unusual property of Ibibio logophors, I must take a 
brief detour into the shifted indexicals literature, which will shed light on the 
difference between Ibibio and other logophoric languages. 


2 Ibibio logophors as shifted indexicals 


2.1 Indexical shift cross-linguistically 


The leading analysis of shifted indexicals in the literature is that proposed by 
Anand (2006), and essentially followed by Sudo (2012); Shklovsky & Sudo (2014); 
Deal (2017), inter alia. Descriptively speaking, shifted indexicals are cases of per- 
son, locative, or temporal indexicals (such as I, here, or yesterday) which, when 
embedded under an attitude verb or verb of saying, do not refer to the utterance 
context, but instead refer to the context established by the embedding verb (in 
these examples, AuTH(c) denotes the speaker of the entire utterance, and ADDR(c) 
denotes the addressee of that utterance. That is, they are used for the English 
non-quotative senses of I and you). 
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(19) Zazaki (Indo-Iranian, Turkey), (Anand & Nevins 2004: 13) 
vizeri Rojda Bill-ra va ke 3zto-ra miradisa 
yesterday Rojda Bill-to said that I you-to angry.be-PRES 


"Yesterday Rojda said to Bill, “I am angry at you” 
"Yesterday Rojda said to Bill, “AUTH(c) is angry at ADDR(c) 


“Yesterday Rojda said to Bill, “AUTH(c) is angry at you: 
“Yesterday Rojda said to Bill, “I am angry at ADDR(c).”’ 


»» 


One property of shifted indexicals cross-linguistically is that they obey Shift To- 
gether: Two indexicals embedded under the same attitude verb must either both 
shift or neither shift, as demonstrated by the possible interpretations of (19), and 
illustrated schematically below. 


(20) SHIFT TOGETHER Constraint (Anand 2006: 297) 
All shiftable indexicals within an attitude-context domain must pick up ref- 
erence from the same context. 


a. C4 [...modal Cg ...[ ind! ...ind} ]] 
b. CA[...modal Cg ...[ ind? ...indP ] 
c. * C4 [...modal Cg ...[ ind^ ...ind} ] 
d. * C4 [...modal Cg ...[ ind? Zodi) 


Anand (2006) derives Shift Together by defining shifting operators that over- 
ride the context values under attitude verbs. Where context parameters typically 
refer directly to the utterance context, these operators modify the context so that 
indexicals in their scope refer to the context set by the attitude verb, rather than 
the context set by the utterance. 


(21) [OP «]*! = oli. where j = (auTH(i), ADDR(c), TIME(c), WORLD(c)) 
(22) [OP æ Je [a], where j = {AUTH(i), ADDR(i), TIME(c), WORLD(c)) 


Because the operators overwrite the contextual information rather than simply 
adding to it, any indexical dependent on an overwritten value is forced to shift, 
and can never “un-shift”. 

Another analysis present in the literature is that of Schlenker (2003). He pro- 
poses in indexical shift languages, the shiftable indexicals are lexically defined to 
optionally shift under the right sort of attitude verb. Non-shiftable indexicals, on 
the other hand, are defined to always take the utterance context, rather than any 
embedded context variable. In this sense, they act very much like bindees under 
an attitude verb (c* refers to the utterance context). 
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(23) a. English ‘T’: +indexical, +c* 


b. Amharic ‘T: +indexical, [underspecified] 


There is one main objection to Schlenker’s proposal: If shiftable indexicals are 
underspecified for a context variable, two embedded indexicals are expected to 
be able to take different context variables; Shift Together is left unexplained. As 
a result, the theory by Anand (2006) summarized above is the analysis more 
commonly used in the literature. However, Schlenker’s theory will come into 
play for my analysis of Ibibio logophors, which I now turn to. 


2.2 Logophors as shifted indexicals 


A tempting solution to the problem of Ibibio logophors is to propose that they 
are actually first-person indexicals, and that Ibibio has a shifting operator that 
shifts those indexicals like in Zazaki and Amharic. This is unfeasible however, 
because true Ibibio indexicals never shift, even if they occur clausemate with a 


logophor: 


(24) Ekpe;a-kere ke (imo;)i-ma i-n-kit mien 
Ekpe 3sc-think C LOG — LoG-PsT LOG-1sG-see 1SG.OBJ 
‘Ekpe; thinks that he; saw me’ 


If there is a shifting operator present in (24) that overwrites the AUTH value in the 
context (as would be expected for a pronoun that refers to the attitude-holder), 
then the first person indexical should also shift. But the true indexical stays con- 
stant to the utterance context. 

In light of this, I claim that although an operator-based approach is essen- 
tially correct for shifted indexicals, the behavior of Ibibio logophors indicates 
that operators alone are not sufficient to account for indexical shift. I propose to 
integrate Schlenker's insight that the pronominals should be defined as shiftable, 
but with the adjustment that a pronominal's sensitivity to shifting is defined lex- 
ically, and no pronominal is underspecified for what context variable it takes. 
Either a pronominal will always shift in the presence of an operator, or it never 
will. For Ibibio, this means that its logophors are defined to shift, so that they take 
a shiftable context variable, while true Ibibio indexicals are defined as unshiftable: 
they take only the matrix context directly. 


(25) a. [imo]®* = AuTH(c) SHIFTABLE 


b. [isc ]5* = AuTH(c*) NOT SHIFTABLE 
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The actual shifting for the logophor cases is accomplished by the author shifter 
in (21) above. Shiftable logophors will otherwise receive the same interpretation 
as unshiftable logophors; the distinction between the two will emerge only in 
deeply embedded clauses where multiple logophors appear; that is, in precisely 
the complex cases I discuss in this paper. In these examples, shifting logophors 
will obligatorily take the same antecedent (as is the case in Ibibio), while non- 
shifting logophors will not be obligated to take the same antecedent (as in Yoruba, 
Able, and other languages). 

A relevant question at this point is whether Ibibio logophors are merely shifted 
indexicals, or whether there is also logophoric binding. A brief consideration of 
the De Re Blocking Effect indicates that Ibibio logophors are also true logophors, 
involving binding by a logophoric operator. 

The De Re Blocking Effect (Anand 2006) states that a de se pronominal (such 
as a logophor) cannot be c-commanded by a de re pronominal. I illustrate this 
with the following examples from Yoruba (Adesola 2005): 


(26) Yoruba (Adesola 2005) 

a. Adéjso pé oun;ti ri wé réj; 
Ade say that oun PERF see book o-gen 
‘Ade; said that he; has seen his; ; book. 

b. Olujso pé out bàbá dun; 
Olu say thato see father oun-gen 
‘Olu; said that he, ;;; has seen his; father: 

c. Olujso pé baba rè;; ri iyá oun; 
Olu say that father o-gen see mother oun-gen 


‘Olu; said that his; father has seen his; mother’ 


In Anand’s theory of logophoricity, (26b) does not allow the weak pronoun to 
refer to the logophoric center because in cases where it is co-indexed with the 
logophor, it is a competing binder for the more deeply embedded logophor, caus- 
ing a condition B effect. This is ameliorated by interrupting c-command between 
the two pronouns, as in (26c). 

Crucially for determining the status of Ibibio logophors, shifted indexicals do 
not show De Re Blocking Effects, demonstrated in Zazaki by (27). 


Context: At a friend’s party, Hesen is shocked to see Ali, the boyfriend of his 
good friend Rojda, flirting with a woman in a big red dress and hat that obscures 
her face. After seeing her kiss Ali, Hesen rushes off to find Rojda. When he finds 
her, he tells her, “The woman in the big red dress kissed your man.” Of course, it 
was Rojda all along, only hidden under a costume! 
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(27) (Anand 2006: 333) 
Heseni va ks Rojdaa layik tiya pach kerd 
Hesen.oBL said that Rojda.oBr boy your kiss did 


‘Hesen said (to Rojda;) that Rojda; kissed her; man’ 


In the context, Hesen identifies Rojda only with the de re relation “the woman in 
the big red dress”, making the occurance of Rojda in the embedded clause de re, 
while the embedded second person indexical is shifted to refer to Rojda. Despite 
the fact that the de se indexical is c-commanded by the de re name, the sentence 
is felicitous in the context. Anand takes this as evidence that shifted indexicals 
are not operator-bound in the same way that logophors are. 

Therefore, if Ibibio logophors are in fact merely shifted indexicals that happen 
to look like logophoric pronouns, we can expect them to show no De Re Blocking 
Effect, parallel to typical shifted indexicals. However, Ibibio logophors behave 
parallel to Yoruba logophors: 


(28) Ekpe;a-ma  a-bo  keimo;i-ma Lt ete amo; 
Ekpe 3sG-PsT 3sG-say C LOG LOG-PST LOG-see father 3sG.POss 
‘Ekpe; said that he; saw bist father. 


(29) Ekpe;a-ma  a-bo keanye,j;,a-ma  a-kit ete imo; 
Ekpe 3sG-PsT 3sG-say C 3sG 3SG-PST 38G-see father LoG.Poss 
‘Ekpe; said that he, ;; saw his; father’ 

(30) Ekpe;a-ma  a-bo keete  amo,j, a-ma  a-kit eka 
Ekpe 3sG-PsT 3sG-say C father 3sG.POSS 3sG-PST 3sG-see mother 
imo; 

LOG.POSS 


‘Ekpe; said that his, jj. father saw his; mother’® 


According to Anand (2006), the De Re Blocking Effect is due to binding compe- 
tition between the de re pronoun and the logophoric operator in the embedded 
left periphery. Given that Ibibio shows this effect, this is evidence that Ibibio lo- 
gophors are not only shifted, but also logophorically bound. (Along with the fact 
that Ibibio logophors cannot appear in matrix clauses, cf. (2b).) 

This means that Ibibio logophors are sensitive to both a shifting operator and 
a logophoric binding operator, a combination otherwise unattested in the de se 
literature. Moreover, the fact that Ibibio logophors shift while Ibibio indexicals 


*Interestingly, interrupting the c-command relation in Ibibio does not seem to improve this case 
in Ibibio as it does in Yoruba. While this is an interesting distinction between Yoruba and Ibibio 
logophors that bears further investigation, it is orthogonal to the point that Ibibio logophors 
also show the De Re Blocking Effect. 
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do not indicates that there is a conspiracy of factors required for indexical shift: 
Not only is there a shifting operator in the left periphery, but the indexicals of the 
language also have to be lexically sensitive to that shifting operator. This creates 
a new source of typological variance, which I elaborate on in the next section. 


3 Typological implications 


The above discussion of Ibibio logophoricity and its insight into indexical shift 
brings to light additional typological considerations; namely, it is now clear that 
languages can vary with regard to what pronominals are defined as shiftable, 
independent of what shifting operators (if any) are defined for the language. 

This additional parameter only introduces minimal extra typological variance 
however, because for indexical shift to actually occur a language must have both 
shifting operators in its lexicon as well as some pronominal that is defined to 
shift under that operator. Similarly for logophors (as has been implicity assumed 
throughout the literature), a logophoric pronoun by itself is not sufficient for 
logophoric reference; it also must be bound by a logophoric operator and ap- 
propriately related to the attitude holder. Given this conspiracy of factors, the 
actual typology predicted is in Table 2, filled in with languages that (potentially) 
exemplify each typological option. 


Table 2: Typology of logophors and shifted indexicals 


Logophors 


No Logophors Shiftable Unshiftable 
No Shifted Indexicals English Ibibio Ewe, Yoruba? 
Shifted Indexicals Zazaki* Aghem?f 
Amharic? 
Uyghur! 
"Clements (1975); Pearson (2015) 
è Adesola (2005) 
“Anand & Nevins (2004); Anand (2006) 
dSchlenker (2003) 


“Sudo (2012); Shklovsky & Sudo (2014) 
fHyman (1979) 


I have already given examples of most of the languages types predicted, but 
Aghem requires some further comment. As described in Hyman (1979), Aghem 
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might be an example of a language with both logophoric pronouns and shifted 
indexicals: 


(31) Aghem (Hyman 1979: 14) 

a. ?wizin "vo ndzéa win Dä é ggé  lghá wo 
woman that said to him that she/LOG muchlike you 
"The woman said to him that she liked him a lot? 

"The woman said to him “I like you a lot?” 

b. sóogó?'va mé  fifá wò lighámüo, mò wò mbaàgló wi 
soldier that (said) that you like me andyouyet are wife (of) 
bà?tóm'... 
chief 


"The soldier said, “you like me, and yet you are the wife of the 
chief? 


These two examples are the only examples in Hyman (1979) containing both lo- 
gophors and embedded indexicals, or even potentially shifted indexicals at all. 
But to my knowledge Aghem indexicals have not been put through any tests 
to show that they are not quotation or partial quotation, nor are there are sen- 
tences with multiple embedded logophors, so there is no way to tell whether the 
logophoric pronouns behave like shifted indexicals either. Aghem's status as a 
indexical shift and logophoric language is therefore uncertain, but I mention is 
as an area of further investigation. 


4 Conclusion 


In this paper I have described Ibibio logophors and situated them in the typol- 
ogy of de se pronominals cross-linguistically. I have shown that they differ from 
other logophoric pronouns in that two clausemate logophors cannot take sep- 
arate antecedents, but instead must refer together. This behavior, while unlike 
other logophoric languages, is reminiscent of a widely-attested restriction on 
shifted indexicals, which must Shift Together. I account for the Ibibio logophor 
behavior by proposing that they are sensitive to the same indexical shifting op- 
erator that is commonly proposed to account for indexical shift. True indexicals 
in Ibibio, which do not shift, are lexically defined as insensitive to this operator. 
The introduction of lexical sensitivity to shifting operators expands the ty- 
pology of de se pronominals in a restricted way, allowing for the existence of 
languages like Ibibio, where logophors shift but regular indexicals do not, and 
potentially languages where both logophors and indexicals shift, as well as lan- 
guages that have (unshiftable, but bound) logophors and shifted indexicals. 
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Abbreviations 


This paper follows the Leipzig Glossing Rules. Additional abbreviations are: 


AUTH author (or speaker) LOG logophoric pronoun/marker 
ADDR  addressee PERF perfect 

ASP aspect PREP  preposition 

C complementizer PRES present 

CONJ conjunction PST past 
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Control is a phenomenon in which the subject of an embedded clause (the “con- 
trolled argument”) is obligatorily bound by an argument of the immediately embed- 
ding predicate (the “controller”). Cross-linguistic research has revealed variation 
in how control sentences are syntactically instantiated, though no studies to date 
have documented cases where the controlled argument is realized by an overt lo- 
gophoric pronoun. Based on novel field data, we argue that precisely this happens 
in Gengbe (Gbe, Niger-Congo), though only with some embedding verbs (such as 
d3i want) and only when the embedded clause has potential (as opposed to jus- 
sive) mood marking. We propose an account of the facts whereby control comple- 
ments are property-denoting and whereby logophoricity and jussive mood are two 
independent routes for creating property-denoting clauses. The upshot is a view of 
control as an emergent phenomenon; there is no “control construction” or “control 
pronoun" (PRO) but rather several independent components of the grammar that in- 
teract to give rise to control under certain conditions for principled type-theoretic 
reasons. 


1 Introduction 


Control is a phenomenon in which the subject of an embedded clause (the “con- 
trolled argument") is obligatorily bound by an argument of the immediately high- 
er embedding predicate (the “controller”). In (1), for example, the unexpressed 
subject of leave (represented here as PRO) can only be understood as coreferen- 
tial with Bill; it cannot be coreferential with John nor can it take an antecedent 
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linguistics across the disciplines: Selected papers from the 48th Annual Conference on African Lin- 
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outside the sentence. (See e.g. Landau 2013 for a recent survey of the vast theo- 
retical literature on control.) 


(1) (John, said that) Bill, wants [PRO,1,2,,3 to leave]. 


Cross-linguistic research has revealed a fair amount of variation in the syntax 
of control. Diverging from the pattern instantiated by English, some languages 
like Tsez (a Nakh-Daghestanian language spoken in northeast Caucasus) evi- 
dence BACKWARD CONTROL, wherein the controlled argument is overt and the 
controller is covert, as in (2). 


(2) Tsez (Nakh-Daghestanian; Polinsky & Potsdam 2002: 248) 
e [kid-bä ziya b-ièr-a] y-oq-si. 
ILABS girl.II-ERG cOW.ABS 111-feed-INF 11-begin-PST.EVID 


"The girl began to feed the cow: 


Still other languages exhibit copy CONTROL, wherein both the controller and 
the controlled argument are overtly represented, as in San Luis Quiaviní Zapotec, 
illustrated in (3). 


(3) San Luis Quiavini Zapotec (Lee 2003: 102) 
R-cààz | Gye'eihlly [g-auh Gye'eihlly bxaady]. 
HAB-want Mike IRR-eat Mike grasshopper 


‘Mike wants to eat grasshopper: 


Another pattern involves control of an overt anaphor, as found for example in 
Korean and illustrated in (4). 


(4) Korean (Madigan 2008: 237) 
Inho-ka Jwuhi-eykey [caki-ka cip-ey ^ ka-keyss-ko] 
Inho-NoM Jwuhi-paT  self-NoM home-Loc go-vor-coMP 
yaksok-ha-yess-ta. 
promise-do-PST-COMP 


‘Inho promised Jwuhi to go home’ 


Finally, yet another pattern involves control of an overt nominative expression, 
which Szabolcsi (2009) has shown obtains in Hungarian under some conditions, 
as in (5). 
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(5) Hungarian (Szabolcsi 2009: 8) 
Szeretnék [csak én magas lenni]. 
would.like.1sG only I tall  be.InF 


‘I want it to be the case that only I am tall? 


The focus of this paper is yet another pattern which is to our knowledge unat- 
tested in the control literature: control of an overt logophoric pronoun.! We argue 
that this happens in Gengbe, in sentences like (6). 


(6) Gengbe (Elicitation) 
Kofi say Ama want COMP LOG POT eat thing 
‘(Kofi said that) Ama wants to eat. 


In what follows, after providing more background on Gengbe and our data 
collection (§2), we will show that control of logophors in Gengbe obtains only 
with some embedding predicates, and only with potential (as opposed to jussive) 
mood marking (§3). Then, in §4, drawing on relevant theoretical literature, we 
will sketch a formal semantic account of the observed facts wherein control com- 
plements denote properties (Chierchia 1984, Dowty 1985) and wherein there are 
two routes to propertyhood: (i) a logophoric subject (Pearson 2015) or (ii) jus- 
sive mood marking (Zanuttini et al. 2012). The upshot is a view of control as an 
emergent phenomenon; there is no “control construction” or “control pronoun” 
but rather several independent components of the grammar that interact to pro- 
duce control under certain conditions for principled type-theoretic reasons. We 
conclude in $5.? 


2 Background on Gengbe and our data collection 


Gengbe (also known as Gen or Mina) is a Niger-Congo (Kwa) language closely 
related to Ewe and spoken in southern Togo and Benin. According to Ethnologue, 
it has 278,900 speakers worldwide. All Gengbe data reported in this paper were 
collected via elicitation sessions at Indiana University during 2014-2016 with 
Gabriel Mawusi, a native Gengbe speaker from Batonou, Togo. These sessions 
were conducted by Samson Lotven and supported by Professor Samuel Obeng. 


‘Landau (2015), for example, citing Culy (1994), says that overt logophoric pronouns are never 
found in control complements. 

?The core data and analysis presented in this paper are reported also in Grano & Lotven (2018), 
where we focus on the implications of the Gengbe data for theories of mood. 
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3 Core data and puzzles 


We begin with the observation that Gengbe has mood markers that are used to 
express future possibility and deontic necessity, respectively, as illustrated in (7) 
and (8). Following Essegbey (2008) on the cognate Ewe particle a, we label the 
former POT(ENTIAL), and following Ameka (2008) on cognate Ewe particle ne, we 
label the latter JUSS(IVE). 


(7) Akula dà nú. 
Aku pot eat thing 
‘Aku will/might eat’ 


(8) Akáné dù ni. 
Aku juss eat thing 
‘Aku should eat want Aku to eat? 


Against this backdrop, the first puzzle we want to consider is that when em- 
bedded under d3i ‘want’, (9) (with potential marking) results in an unacceptable 
sentence whereas (10) (with jussive marking) results in an acceptable sentence: 


(9) *Ámádszí [bé Akula dà ni]. 
Ama want comp Aku pot eat thing 


Intended: ‘Ama wants Aku to eat: 


(10) Ámádzí [bé Akonë dà nil. 
Ama want comp Aku Juss eat thing 


‘Ama wants Aku to eat. 


At first glance, the puzzle in (9-10) seems familiar enough: in many languages, 
verbs that embed clausal complements can only combine with clauses that bear a 
particular mood. In Romance languages, for example, ‘want’ requires the subjunc- 
tive mood (see e.g. Palmer (2001) for an overview). Hence we might hypothesize 
that in Gengbe, d3i ‘want’ requires jussive mood. 

But additional data reveal that this cannot be the whole story. Gengbe has 
a logophoric pronoun jé which in sentences like (11-12) behaves like other lo- 
gophoric pronouns reported in the literature: it must be anteceded by an atti- 
tude holder, and multiple embedding gives rise to an ambiguity in antecedent 
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choice whereby any c-commanding attitude holder can serve as the antecedent 
(see e.g. Clements 1975; Pearson 2015).? 


(11) Ámá, ká dó é dzí[bé Jéis dù núl. 
Ama be.certain COMP LOG eat thing 


“Ama is certain that she (= Ama) ate 


(12) Kofi, bé Ámá, ká dó é dzí[bé  jéj/5,5 du nú]. 
Kofi say Ama be.certain COMPLOG  eatthing 
‘Kofi said Ama is certain that he/she (= Kofi/Ama) ate 


This leads us to our second puzzle. When jé is embedded under d3i ‘want’, 
both potential and jussive mood become acceptable in the embedded clause, but 
with consequences for antecedent choice. When the potential marker is used, the 
logophor can only be anteceded by the immediately higher subject, instantiating 
a control relation as illustrated in (13), whereas when the jussive marker is used, 
the logophor is obligatorily obviative with respect to the immediately higher 
subject and instead must be bound remotely, as illustrated in (14). 


(13) Kofi, bé Ama, d3i [bé jè1/223 la du nú]. 
Kofi say Ama want COMPLOG POT eat thing 


‘Kofi said Ama wants to eat? CONTROL 


(14) Kofi; bé Ama, dai [bé ;jéj,2,5né dù núl. 
Kofi say Ama want COMP LOG Juss eat thing 


‘Kofi said Ama wants him ( = Kofi) to eat. OBVIATION 


As expected given this second puzzle, if the desire reports in (13-14) appear 
unembedded, the former is grammatical and has an obligatory control interpreta- 
tion (15), whereas the latter is simply ungrammatical (16): the logophor demands 
an antecedent, but the jussive marker forces obviation with respect to the only 
potential antecedent, leading to an irreconcilable conflict. 


(15) Ama, dei [bé jéj,2,1á dù núl. 
Ama want COMP LOG POT eat thing 


‘Ama wants to eat: 


3We use underscores in kí dé é dei ‘be certain’ to signal that it is morphologically complex, 
consisting of a verb kä ‘cut’ and a third-person singular pronoun é flanked by two adpositions 
do ‘at’ and d3i ‘top’. An English gloss more faithful to this underlying structure would be ‘count 
on it that ..". We nonetheless gloss Lä dó é d3i as ‘be.certain’ in the interest of perspicuity. 
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(16) *Âmädzi [bé jè né du ni]. 
Ama want COMP LOG JUSS eat thing 


Intended: 'Ama wants to eat: 


Other attitude predicates that pattern like d3i ‘want’ with respect to this puzzle 
include w3 _súsú ‘intend’ (lit.: ‘do thought’), dzè àgbàgbá ‘try’ (lit.: ‘do ability’), 
l3 ‘agree’, and fjé_dzagbe ‘pledge’. 

In contrast with ‘want’ and the other attitude predicates just mentioned, the 
majority of attitude predicates do not behave likewise with respect to these puz- 
zles; these predicates include ká dé é det ‘be certain’, nd ‘know’, gbl5 ‘say’, and 
Fon drii ‘dream’ (lit.: ‘die dream’). With these attitude predicates, complement 
clauses do not have to contain an overt mood marker (as already illustrated for 
‘be certain’ in (11-12) above). And when they do contain an overt mood marker, 
both potential and jussive mood are compatible with a full-NP subject, as seen 
in (17-18). Furthermore, with a logophoric subject, potential marking gives rise 
to ambiguity in antecedent choice (19) whereas jussive marking patterns like it 
does for ‘want’ in forcing obviation (20). 


(17 Kofibé Ama ká dé é dzí[bé Akula du nil. 
Kofi say Ama be.certain comp Aku pot eat thing 
‘Kofi said Ama is certain that Aku will eat 


(18) Kôfibé Ámáká dé é dzí[bé  Akü né dù ná]. 
Kofi say Ama be.certain comp Aku Juss eat thing 
"Kofi said Ama is certain that Aku should eat? 


(19) Kofi, bé Ámá, ká dó é dzí[bé jéy.14 dù núl. 
Kofi say Ama be.certain COMP LOG POT eat thing 
‘Kofi said Ama is certain that he/she (= Kofi/Ama) will eat 


(20) Kofi, bé Ama, ká dó é dzí[bé  jéj,;né dù núl. 
Kofi say Ama be.certain COMP LOG Juss eat thing 
"Kofi said Ama is certain that he ( = Kofi) should eat? 


See Grano & Lotven (2018) for further discussion and analysis of this class of 
predicates. In what follows, we focus on the class exemplified by ‘want’. 
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4 Toward an account 


In (21), we illustrate a run-of-the-mill modal semantics for want-sentences mod- 
eled after Hintikka’s (1969) influential approach to attitude reports, achieved com- 
positionally via the denotation for want supplied in (22) and revised in (25) be- 
low. Here, WANT(x,w) denotes the set of worlds compatible with x’s desires in 
w. This is no doubt an oversimplified view of the semantics of desire reports for 
reasons discussed in such works as Heim (1992), but it is sufficient for our pur- 
poses, where all that is crucial is that want denotes some kind of relation between 
individuals and propositions. 


(21) [John wants Bill to eat]]” 
= Vw'[w' € WANT(j,w) — EAT(b) in w'] 
~ ‘All those worlds compatible with what John wants in w are worlds in 
which Bill eats: 


(22) [[want]]” = Apr Ax.vw’[w’ € WANT(x,w) — p(w’)] Version 1/2 


When we turn to control sentences like (23), on the other hand, we observe 
that the matrix subject John appears to play two roles semantically, naming both 
the attitude holder and the individual who eats in those worlds compatible with 
the attitude holder’s desires. In other words, it has a denotation like (24).* 


(23) John wants to eat. 


(24) [John wants to eat |" 
= Vw'[w' € WANT(j,w) — EAT(j) in wll 
~ ‘All those worlds compatible with what John wants in w are worlds in 
which John eats: 


Borrowing an insight from Chierchia (1984) and Dowty (1985), we can achieve 
this compositionally with a revised semantics for want as in (25). Here, want 
denotes a relation between individuals and properties. When the individual x 
is plugged in, it values both the attitude holder and the unsaturated argument 
associated with the property. 


(25) [[want]]” = AP« sy Ax.vw' [w' € WANT(x,w) — P(x)(w’)] Version 2/2 


“Something not captured by the denotation in (24) and that we abstract away from since it is 
orthogonal to our purposes is that attitude reports expressed by control sentences have an 
obligatory de se semantics. See Stephenson 2010; Pearson 2015, 2016 for recent approaches. 
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Turning our attention back to Gengbe, we propose that né ‘juss’ contributes 
an individual argument whereas lá ‘pot’ does not. As schematized in (26), this 
means that if the semantic type of an unmarked clause in Genge is proposition- 
denoting (type <st)), a jussive-marked clause is property-denoting (type <e, st)). 
Potential marking, by contrast, has no type-theoretic effect; a potential-marked 
clause is proposition-denoting. 


(26) [Kofi eat]; — [Ax . Kofi Juss eat] ¢¢ st 


a. 
b. [Kofi eat], — [Kofi ror eat] 


This is a natural extension of ideas developed by Portner (2004; 2007); Zanut- 
tini et al. (2012) that imperative clauses (and jussive clauses more generally) are 
property-denoting rather than proposition-denoting. Intuitively, the individual 
argument introduced by the jussive marker can be thought of as the individual 
who bears the responsibility for bringing about the action named by the clause. 
See Grano & Lotven (2018) for further discussion. 

With these proposals in place, the first part of the puzzle is now solved, provid- 
ed Gengbe d3i ‘want’ has the property-theoretic denotation in (25). As schema- 
tized in (27), when ‘want’ combines with a potential-marked clause, the result is a 
type mismatch, because ‘want’ needs to combine with a property-denoting com- 
plement and yet its complement is propositional. When ‘want’ combines with 
a jussive-marked clause, on the other hand, there is no problem, since jussive 
clauses are property-denoting. 


d 


(27) a. Ama wante sty ¢e,styy [Aku POT eat] (5p) <— *! 
b. Ama want s+) ¢e,st)y [Ax.Aku Juss eat] ¢¢ sty < ok! 


In order to solve the second part of the puzzle, we need to say something 
about the semantic analysis of logophors. Following Heim (2002), von Stechow 
(2009; 2003), and Pearson (2015), we adopt the proposal that what distinguishes 
logophoric pronouns from ordinary pronouns is that logophors are obligatorily 
bound by an attitude predicate, thereby creating a derived property for the bind- 
ing attitude predicate to combine with. As schematized in (28), this means that 
if a logophor is embedded under two attitude predicates, it can in principle be 
bound either by the immediately embedding attitude predicate or by the more 
distant one, but if it is not bound by either, the result is ungrammatical.? 


? According to an anonymous reviewer, a logophor can be embedded under a non-attitude verb 
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(28) a. Kofi say [ Ama be.certain [ Ax. [ Loc, eat ] ] ] < ok! 
b. Kofi say [ Ax. [ Ama be.certain [ LOG, eat ] ] ] < ok! 
c. Kofi say [ Ama be.certain [ Loc, eat ] ] <— *! 


This approach to logophors, together with the other proposals already intro- 
duced, are sufficient for solving the second part of the puzzle. To see this, con- 
sider (29-30). In (29), where the complement to ‘want’ has potential marking, 
local binding of the logophor is licit since this will yield the needed property- 
theoretic denotation for the complement to ‘want’, but remote binding is ruled 
out since it results in a proposition-denoting complement to ‘want’. In (30) with 
jussive marking, by contrast, the opposite obtains: local binding of the logophor 
results in a type <e, <e, st)» denotation for the complement to ‘want’, yielding a 
type mismatch, whereas remote binding of the logophor preserves the property- 
denoting status of the complement clause and is hence licit. 


(29) a. Kofi say Ama [want 5,65 [AX. LOG, POT eat] ¢¢ sty ] <— ok! 


re 


b. Kofi say [Ax.Ama want st ¢e sty [LOG POT eat] (57) ] — 


(30) a. Kofi say Ama [want st} <e, styy [AxAy. LOG, JUSSNNN eat] (6 ve sty] <— 


i 


b. Kofi say [Ax.Ama want st deer, [Ay. LOG, JUSSNNN eat] ¢¢ sty ] —— ok! 


as long as it appears in a clause introduced by the complementizer be, as in the following 
example supplied by the reviewer (using Ewe orthography but holding for Gengbe as well, 
according to the reviewer). 


i) Kofi yi Lome be e-a-fle avo. 
y y 
Kofi go Lome comp roc-sugJ-buy cloth 


'Kofi went to Lome to buy cloth: 


We note, however, the following example due to Pearson (2015) showing that logophors in 
Ewe cannot be embedded under a causative predicate even in the presence of the complemen- 
tizer be. 


(ii) Kofiwobe  e/'ye dzo. 
Kofi do comp 3sc/Loc leave 


‘Kofi caused himself to leave. (Pearson 2015:96) 
In light of the asymmetry between (i) and (ii), we hypothesize that (i) may contain a silent 


attitude predicate that licenses the logophor; plausibly it has a meaning like ‘intend’ given that 
the rationale clause in (i) is paraphrasable as ‘with the intention of buying cloth’. 
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Finally, consider what happens when the desire reports in (29-30) are unem- 
bedded. When potential marking is used as in (31), local binding of the logophor 
leads to type-theoretic well-formedness. The only other option is to leave the 
logophor unbound, which leads both to a type mismatch as well as to a violation 
of the constraint that logophors must be bound. When jussive marking is used as 
in (32), on the other hand, local binding of the logophor leads to a type mismatch 
and non-binding of the logophor violates the constraint that logophors must be 


bound. Thus we accurately predict ungrammaticality for such cases.° 

(31 a. Ama [wantye ep <e styy [Ax. LOG, POT eat] (6 sty ] <— ok! 
b. Ama wante st) (esty) [LoG POT eat] 54) €— Si 

(32) a. Ama [wantye sp zess [AXA y. LOG, JUSSNNN eat] ear, ] 1 “l 
b. Ama wante p. (esty [Ay. LOG, JUSSNNN eat] ¢¢ st — *! 


In a nutshell, the observed interaction between mood choice and logophoric 
antecedent choice falls out automatically from familiar, previously proposed 
ideas about the type-theoretic effects of mood markers, logophors, and embed- 
ding verbs.” 


5 Conclusions 


Our central conclusions are twofold. First, on the empirical end, we have argued 
that Gengbe exhibits control of logophors under particular syntactic conditions 
in a way that depends on the choice of the embedding verb and on the mood 


"An anonymous reviewer asks what happens when a non-logophoric pronoun is used in place 
of a logophoric pronoun in configurations in cases like (31-32). Unfortunately, we do not have 
data on this, but we can say a few words about what our theory predicts. Insofar as non- 
logophoric pronouns are distinct from logophoric pronouns in that they are optionally (as 
opposed to obligatorily) bound by attitude predicates, the prediction of our theory is that they 
should be grammatical both with potential marking (since they can be bound) and with jussive 
marking (since they need not be bound). That being said, Pearson (2015: 97) reports variation 
among Ewe speakers in whether they accept bound construals of non-logophoric pronouns. 
Possibly, rejection of a bound construal could be due to a kind of pragmatic blocking effect 
whereby the failure to use a logophor biases an interpreter toward a non-bound interpretation. 
If logophors force a de se construal then such a blocking effect would be nullified by setting up 
a non-de se context so that the logophor would not be a viable alternative; however, a central 
claim of Pearson (2015) is that logophors (at least in Ewe) need not be construed de se, contra 
the received wisdom. 

7 Another consequence of this approach is that ‘be certain’ and other predicates that pattern like 
it need to be type-theoretically flexible in being able to take either a property or a proposition 
as their first argument. See Grano & Lotven (2018) for illustration and further discussion. 
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marking in the clause containing the logophor. On the theoretical end, we have 
shown how to account for the relevant facts in a system whereby independently 
acting parts (verbs, logophors, and mood markers) interact with each other to 
give rise to control in certain combinations for principled type-theoretic reasons. 
As already alluded to in the introduction, one consequence of this system is a 
view of control as an “emergent” phenomenon: no single element in the structure 
of the sentence is responsible for control, it is only their interaction that ends up 
mattering. 

If the type-theoretic principles that form the backbone of our proposal have 
wide cross-linguistic currency, why is control of logophors seemingly so rare? 
We suggest that this is because it depends on the convergence of two features that 
vary independently. First, not all languages have logophors to begin with, so for 
obvious reasons, only those languages that have logophors have the potential for 
control of logophors. Second, in many languages, verbs like ‘want’ that typically 
take control complements also typically tend to take structurally impoverished 
complements that preclude an overt subject. Gengbe, on the other hand, has both 
logophors and the syntax to support a logophor under a verb like ‘want’, thereby 
giving rise to the right conditions for the phenomenon to emerge. 


Abbreviations 

ABS Absolutive JUSS  Jussive 

comp Complementizer LOC Locative 

DAT Dative LOG  Logophor 

ERG Ergative NOM Nominative 

EVID  Evidential POT Potential 

HAB Habitual PST Past 

INF Infinitive voL Volitional 

IRR Irrealis Lise  1st/3rd-person singular 
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Since the hiring of its first Africanist linguist Carleton Hodge in 1964, Indiana Univer- 
sity’s Department of Linguistics has had a strong and continuing presence in the study 
of African languages and linguistics through the work of its faculty and of its graduates 
on the faculties of many other universities. In 1972, the Indiana University Department 
of Linguistics hosted the third Annual Conference of African Linguistics. Proceedings 
of that conference were published by Indiana University Publications (African Series, 
vol. 7). In 1986, IU hosted the seventeenth Annual Conference of African Linguistics 
with Paul Newman and Robert Botne editing the proceedings in a volume entitled Cur- 
rent approaches to African linguistics, vol. 5. In 2016, Indiana University hosted the 48th 
Annual Conference on African Linguistics with the theme African linguistics across the 
disciplines. Proceedings of that meeting are published in this volume. 

The papers presented in this volume reflect the diversity of opportunities for lan- 
guage study in Africa. This collection of descriptive and theoretical work is the fruit of 
data gathering both in-country and abroad by researchers of languages spoken across 
the continent, from Sereer-Siin in the west to Somali in the northeast to Ikalanga in the 
south. The range of topics in this volume is also broad, representative of the varied field 
work in country and abroad that inspires research in African linguistics. This collection 
of papers spans the disciplines of phonology (both segmental and suprasegmental), mor- 
phology (both morphophonological and morphosyntactic), syntax, semantics, and lan- 
guage policy. The data and analyses presented in this volume offer a cross-disciplinary 
view of linguistic topics from the many under-resourced languages of Africa. 
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